PCT 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 
C12N 15/12, C07K 14/715, 16/28 



Al 



(11) International Publication Number: WO 98/49307 

(43) International Publication Date: 5 November 1998 (05.1 1.98) 



(21) International Application Number: PCT/US98/08865 

(22) International Filing Date: 1 May 1998 (01.05.98) 



(30) Priority Data: 

60/045,287 
08/850,030 
09/023.890 
60/074,721 



1 May 1997 (01.05.97) US 

1 May 1997 (01.05.97) US 

1 3 February 1 998 ( 1 3.02.98) US 

1 3 February 1 998 ( 1 3.02.98) US 



(71) Applicant: ZYMOGENETICS, INC. [US/US]; 1201 Eastlake 

Avenue East, Seattle, WA 98102 (US). 

(72) Inventors: LOK, Si; 806 N.W. 52nd, Seattle, WA 98107 (US). 

PRESNELL, Scott, R.; 919 North 83rd Street, Seattle, WA 
98103 (US). JELMBERG, Anna, C; 170 2nd Avenue N.W. 
#203, Issaquah, WA 98027 (US). GILBERT, Teresa; 246 
12th Street S.E., Seattle, WA 98002 (US). WHITMORE, 
Theodore, E.; 6916 152nd Avenue N.E., Redmond, WA 
98052 (US). FOSTER, Donald, C; 3002 N.E. 181st Street, 
Seattle, WA 98155 (US). ADAMS, Robyn, L.; 14418 S.E. 
15th Street, Bellevue, WA 98007 (US). LEHNER, Joyce, 
M.; 6522 Phinney Avenue North #201, Seattle, WA 98103 
(US). 



(74) Agent: LUNN, Paul, G.; ZymoGenetics, Inc., 1201 Eastlake 
Avenue East, Seattle, WA 98102 (US). 



(81) Designated States: AL, AM, AT, AU, AZ, BA, BB, BG, BR, 
BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, GE, 
GH, HU, IL, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, 
LS, LT, LU, LV, MD, MG, MK, MN, MW, MX, NO, NZ, 
PL, PT f RO, RU. SD, SE, SG, SI, SK, TJ, TM, TR, TT, 
UA, UG, UZ, VN, ZW, ARIPO patent (GH, GM, KE, LS, 
MW, SD, SZ, UG, ZW), Eurasian patent (AM, AZ, BY, 
KG, KZ, MD, RU, TJ, TM), European patent (AT, BE, CH, 
CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, 
PT, SE), OAPI patent (BF, BJ, CF, CG, CI, CM, GA, GN, 
ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: MAMMALIAN CYTOKINE-LIKE RECEPTOR 
(57) Abstract 

Novel receptor polypeptides, polynucleotides encoding the polypeptides, and related compositions. The polypeptides of the present 
invention can be used to down-regulate their natural ligands. The polynucleotides and subsequences thereof can be used as diagnostic 
probes to determine if chromosome 19 is mutated. The antibodies which bind to the polypeptides can be used to purify the receptors and 
to inhibit the binding of the ligands onto the receptors. 



A10 - 09/608,918 



FOR THE PURPOSES OF INFORMATION ONLY 
Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HTJ 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Centra) African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CII 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


CI 


Cflte d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






cu 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






cz 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 98/49307 



PCT/US98/08865 



MAMMALIAN CYTOKINE -LIKE RECEPTOR 

5 

BACKGROUND OF THE INVENTION 



Proliferation and differentiation of cells of 
multicellular organisms are controlled by hormones and 

10 polypeptide growth factors. These diff usable molecules 
allow cells to communicate with each other and act in 
concert to form cells and organs, and to repair and 
regenerate damaged tissue. Examples of hormones and 
growth factors include the steroid hormones (e.g. 

15 estrogen, testosterone) , parathyroid hormone, follicle 
stimulating hormone, the interleukins, platelet derived 
growth factor (PDGF) , epidermal growth factor (EGF) , 
granulocyte -macrophage colony stimulating factor (GM-CSF) , 
erythropoietin (EPO) and calcitonin. 

20 

Hormones and growth factors influence cellular 
metabolism by binding to receptors. Receptors may be 
integral membrane proteins that are linked to signaling 
pathways within the cell, such as second messenger 
25 systems. Other classes of receptors are soluble 
molecules, such as the transcription factors. 

Of particular interest are receptors for 
cytokines, molecules that promote the proliferation and/or 

30 differentiation of cells. Examples of cytokines include 
erythropoietin (EPO) , which stimulates the development of 
red blood cells; thrombopoietin (TPO) , which stimulates 
development of cells of the megakaryocyte lineage; and 
granulocyte-colony stimulating factor (G-CSF) , which 

35 stimulates development of neutrophils. These cytokines 
are useful in restoring normal blood cell levels in 
patients suffering from anemia or receiving chemotherapy 
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for cancer. The demonstrated in vivo activities of these 
cytokines illustrates the enormous clinical potential of, 
and need for, other cytokines, cytokine agonists, and 
cytokine antagonists. 

5 

SUMMARY OF THE INVENTION 

The present invention addresses this need by 
providing a novel mammalian cytokine- like receptor called 

10 mammalian 2cytor5, and related compositions and methods. 
Within one aspect, the present invention provides an 
isolated human polynucleotide encoding a ligand-binding 
human receptor polypeptide. The polypeptide comprises a 
sequence of amino acids containing (a) the amino acid 

15 residues of SEQ ID NO: 17, residues 35 to 422 of SEQ ID 
NO: 2; (b) allelic variants of (a); and (c) sequences that 
are at least 90%, 95% or 99% identical to (a) or (b) . In 
an alternative embodiment, the polypeptide is comprised of 
amino acid residues 3 0 to and including amino acid residue 

20 422 of SEQ ID NO: 2 . 

The present invention also provides for a 
polynucleotide encoding another allelic variant of SEQ ID 
NO: 2 which is a human polypeptide receptor and is defined 

25 by SEQ ID NO: 4 in particular the polypeptide comprised of 
a sequence of amino acids containing (a) the amino acid 
residues of SEQ ID NO: 18, residues 34 to 425 of SEQ ID 
NO:4; (b) allelic variants of (a) ,- and (c) sequences that 
are at least 90%, 95% or 99% identical to (a) or (b) . In 

30 an alternative embodiment, the polypeptide is comprised of 
amino acid residues 29 to and including amino acid residue 
425 Of SEQ ID NO: 4. 

Other polynucleotides of the present invention 
35 encode the amino acid sequence of SEQ ID NO: 21 which is a 
soluble receptor of SEQ ID NO: 17 that does not contain a 
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C- terminus phosphatidylinositol signal sequence; the amino 
acid sequence of SEQ ID NO: 20 is a ZcytorS polypeptide of 
SEQ ID NO: 2 having an alternative N-terminus cleavage 
site; SEQ ID NO: 22 which has an alternative N-terminus 

5 cleavage site of the ZcytorS polypeptide of SEQ ID NO:4; 
SEQ ID NO: 23 which is an amino acid of SEQ ID NO: 18 that 
does not contain a C-terminus phosphatidylinositol signal 
sequence and the amino acid sequences defined by SEQ ID 
NOs: 24-31 which are variants of the ZcytorS polypeptide 

10 of SEQ ID NO: 4. 



Another embodiment of the present invention is a 
polynucleotide which encodes rat ZcytorS. In particular, a 
polynucleotide is claimed which encodes a rat polypeptide 
15 containing (a) the amino acid sequence of SEQ ID NO: 19 

residues 41 to 425 of SEQ ID NO: 6; (b) allelic variants of 
(a) ; and (c) sequences that are at least 90%, 95% or 99% 
identical to (a) or (b) . 



20 Within a second aspect of the invention there is 

provided an expression vector comprising (a) a 
transcription promoter; (b) a DNA segment encoding a 
secretory peptide and a ligand-binding ZcyotorS receptor 
polypeptide, containing an amino acid sequence as 

25 described above. 



Within a third aspect of the invention there is 
provided a cultured eukaryotic cell into which has been 
introduced an expression vector as disclosed above, 
30 wherein said cell expresses a mammalian ZcytorS receptor 
polypeptide encoded by the DNA segment. 



Within a fourth aspect of the invention there is 
provided an isolated polypeptide. The polypeptide 
35 comprises a sequence of amino acids containing (a) the 
amino acid sequence of SEQ ID NO: 17, residues 35 to 422 
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of SEQ ID NO:2; (b) allelic variants of (a); and (c) 
sequences that are at least 90%, 95% or 99% identical to 
(a) or (b) . In an alternative embodiment, the polypeptide 
is comprised of amino acid residues 30 to and including 
5 amino acid residue 422 of SEQ ID NO: 2. 

The present invention also provides for another 
allelic variant of SEQ ID NO: 2 which is a human 
polypeptide receptor and is defined by SEQ ID NO: 4 in 

10 particular the polypeptide is comprised of a sequence of 
amino acids containing (a) the amino acid sequence of SEQ 
ID NO: 18, residues 34 to 425 of SEQ ID NO: 4; (b) allelic 
variants of (a) ; and (c) sequences that are at least 90%, 
95% or 99% identical to (a) or (b) . In an alternative 

15 embodiment, the polypeptide is comprised of residues 29 to 
425 of SEQ ID NO: 4. 

Another embodiment of the present invention is a 
rat ZcytorS polypeptide containing (a) the amino acid 
20 sequence of SEQ ID NO: 19, residues 41 to 425 of SEQ ID 
NO: 6; (b) allelic variants of (a) ; and (c) sequences that 
are at least 80% identical to (a) or (b) . 

Within a further aspect of the invention there 
25 is provided a chimeric polypeptide consisting essentially 
of a first portion and a second portion joined by a 
peptide bond. The first portion of the chimeric 
polypeptide consists essentially of a ZcytorS receptor 
polypeptide as described above. The invention also 
30 provides expression vectors encoding the chimeric 

polypeptides and host cells transfected to produce the 
chimeric polypeptides . 



35 



The invention also provides a method for 
detecting a ligand within a test sample, comprising 
contacting a test sample with a ZcytorS polypeptide as 
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disclosed above, and detecting binding of the polypeptide 
to ligand in the sample. The polypeptide can be membrane 
bound within a cultured cell, wherein the detecting step 
comprises measuring a biological response in the cultured 
cell. Within another embodiment, the polypeptide is 
immobilized on a solid support.' 

Within an additional aspect of the invention 
there is provided an antibody that specifically binds to a 
polypeptide as disclosed above and an anti-idiotypic 
antibody of an antibody which specifically binds to a 
ZcytorS antibody, also a method for producing an antibody 
to ZcytorS . 

An additional embodiment of the present 
invention relates to a peptide or polypeptide which has 
the amino acid sequence of an epitope -bearing portion of a 
ZcytorS polypeptide having an amino acid sequence 
described above. Peptides or polypeptides having the amino 
acid sequence of an epitope -bearing portion of a ZcytorS 
polypeptide of the present invention include portions of 
such polypeptides with at least nine, preferably at least 
15 and more preferably at least 30 to 50 amino acids, 
although epitope -bearing polypeptides of any length up to 
and including the entire amino acid sequence of a 
polypeptide of the present invention described above are 
also included in the present invention. Examples of said 
polypeptides are defined by the amino acid sequences of 
SEQ ID NOs: 32-37. 

DETAILED DESCRIPTION OF THE INVENTION 

The teachings of all of the references cited in 
the present specification are incorporated in their 
entirety herein by reference. 
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The term "affinity tag" is used herein to denote 
a polypeptide segment that can be attached to a second 
polypeptide to provide for purification or detection of 
the second polypeptide or provide sites for attachment of 
the second polypeptide to a substrate. In principal, any 
peptide or protein for which an antibody or other specific 
binding agent is available can be used as an affinity tag. 
Affinity tags include a poly-histidine tract, protein A 

[Nilsson et al, EMBO J m 4:1075 (1985); Nilsson et al . , 
Methods Enzymol. 198:3 (1991)], glutathione S transferase 

[Smith and Johnson, Gene 57:31 (1988)], Glu-Glu affinity 
tag [Grussenmeyer et al . , Proc. Natl. Acad. Sci. USA 
82:7952-4 (1985)], substance P, FLAG™ peptide (Hopp et 
al., Biotechnology 5:1204-10 (1988), streptavidin binding 

peptide, or other antigenic epitope or binding domain. 
See, in general, Ford et al., Protein Expression and 
Purification 2: 95-107 (1991). DNAs encoding affinity 
tags are available from commercial suppliers (e.g., 
Pharmacia Biotech, Piscataway, NJ) . 

The term "allelic variant" denotes any of two or 
more alternative forms of a gene occupying the same 
chromosomal locus. Allelic variation arises naturally 
through mutation, and may result in phenotypic 
polymorphism within populations. Gene mutations can be 
silent (no change in the encoded polypeptide) or may 
encode polypeptides having altered amino acid sequence. 
The term allelic variant is also used herein to denote a 
protein encoded by an allelic variant of a gene. 

The term "allelic variant" is used herein to 
denote any of two or more alternative forms of a gene 
occupying the same chromosomal locus. Allelic variation 
arises naturally through mutation, and may result in 
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15 



phenotypic polymorphism within populations. Gene 
mutations can be silent (no change in the encoded 
polypeptide) or may encode polypeptides having altered 
amino acid sequence. The term allelic variant is also 
used herein to denote a protein encoded by an allelic 
variant of a gene. 

The terms "amino- terminal" and " carboxyl - 
terminal" are used herein to denote positions within 
polypeptides. Where the context allows, these terms are 
used with reference to a particular sequence or portion of 
a polypeptide to denote proximity or relative position. 
For example, a certain sequence positioned carboxyl - 
terminal to a reference sequence within a polypeptide is 
located proximal to the carboxyl terminus of the reference 
sequence, but is not necessarily at the carboxyl terminus 
of the complete polypeptide. 

The term "complement/anti-complement pair" 
denotes non-identical moieties that form a non-covalently 
associated, stable pair under appropriate conditions. For 
instance, biotin and avidin (or streptavidin) are 
prototypical members of a complement /ant i- complement pair. 
Other exemplary complement/anti-complement pairs include 
25 receptor/ligand pairs, antibody/antigen (or hapten or 

epitope) pairs, sense/antisense polynucleotide pairs, and 
the like. Where subsequent dissociation of the 
. complement /ant i- complement pair is desirable, the 
complement /anti -complement pair preferably has a binding 
30 affinity of <10 9 NT 1 . 

The term "complements of a polynucleotide 
molecule" is a polynucleotide molecule having a 
complementary base sequence and reverse orientation as 
35 compared to a reference sequence. For example, the 



20 



WO 98/49307 PCT/US98/08865 



sequence 5' ATGCACGGG 3' is complementary to 5' CCCGTGCAT 
3 ' . 

The term "contig" denotes a polynucleotide that 
5 has a contiguous stretch of identical or complementary 
sequence to another polynucleotide. Contiguous sequences 
are said to "overlap" a given stretch of polynucleotide 
sequence either in their entirety or alone a partial 
stretch of the polynucleotide. For example, 
10 representative contigs to the polynucleotide sequence 5'- 
ATGGCTTAGCTT-3 1 are 5 ' -TAGCTTgagtct - 3 ' and 3 ' -gtcgacTACCGA- 
5" . 

The term "degenerate nucleotide sequence" 
15 denotes a sequence of nucleotides that includes one or 
more degenerate codons (as compared to a reference 
polynucleotide molecule that encodes a polypeptide) . 
Degenerate codons contain different triplets of 
nucleotides, but encode the same amino acid residue (i.e., 
20 GAU and GAC triplets each encode Asp) . 



The term "expression vector" is used to denote a 
25 DNA molecule, linear or circular, that comprises a segment 
encoding a polypeptide of interest operably linked to 
additional segments that provide for its transcription. 
Such additional segments include promoter and terminator 
sequences, and may also include one or more origins of 
30 replication, one or more selectable markers, an enhancer, 
a polyadenylation signal, etc. Expression vectors are 
generally derived from plasmid or viral DNA, or may 
contain elements of both. 



35 



The term "isolated", when applied to a 
polynucleotide, denotes that the polynucleotide has been 
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removed from its natural genetic milieu and is thus free 
of other extraneous or unwanted coding sequences, and is 
in a form suitable for use within genetically engineered 
protein production systems. Such isolated molecules are 
5 those that are separated from their natural environment 
and include cDNA and genomic clones. Isolated DNA 
molecules of the present invention are free of other genes 
with which they are ordinarily associated, but may include 
naturally occurring 5» and 3» untranslated regions such as 
10 promoters and terminators. The identification of 

associated regions will be evident to one of ordinary 
skill in the art, for example, Dynan and Tijan, Nature 
316:774-78 (1985) . 

15 An "isolated" polypeptide or protein is a 

polypeptide or protein that is found in a condition other 
than its native environment, such as apart from blood and 
animal tissue. In a preferred form, the isolated 
polypeptide is substantially free of other polypeptides, 

20 particularly other polypeptides of animal origin. It is 
preferred to provide the polypeptides in a highly purified 
form, i.e. greater than 95% pure, more preferably greater 
than 99% pure. When used in this context, the term 
"isolated" does not exclude the presence of the same 

25 polypeptide in alternative physical forms, such as dimers 
or alternatively glycosylated or derivatized forms. 

"Operably linked", when referring to DNA 
segments, indicates that the segments are arranged so that 
30 they function in concert for their intended purposes, e.g. 
transcription initiates in the promoter and proceeds 
through the coding segment to the terminator. 

The term "ortholog" (or "species homolog") 
35 denotes a polypeptide or protein obtained from one species 
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that has homology to an analogous polypeptide or protein 
from a different species. 

The term "paralog" denotes a polypeptide or 
5 protein obtained from a given species that has homology to 
a distinct polypeptide or protein from that same species. 

A "polynucleotide" is a single- or double- 
stranded polymer of deoxyribonucleotide or ribonucleotide 
10 bases read from the 5' to the 3* end. Polynucleotides 
include RNA and DNA, and may be isolated from natural 
sources, synthesized in vitro, or prepared from a 
combination of natural and synthetic molecules. 

15 A "polypeptide" is a polymer of amino acid 

residues joined by peptide bonds, whether produced 
naturally or synthetically. Polypeptides of less than 
about 10 amino acid residues are commonly referred to as 
"peptides" . 

20 

The term "promoter" is used herein for its art- 
recognized meaning to denote a portion of a gene 
containing DNA sequences that provide for the binding of 
RNA polymerase and initiation of transcription. Promoter 
25 sequences are commonly, but not always, found in the 5 1 
non-coding regions of genes. 

A "protein" is a macromolecule comprising one or 
more polypeptide chains. A protein may also comprise non- 
30 peptidic components, such as carbohydrate groups. 
Carbohydrates and other non-peptidic substituents may be 
added to a protein by the cell in which the protein is 
produced, and will vary with the type of cell. Proteins 
are defined herein in terms of their amino acid backbone 
35 structures; substituents such as carbohydrate groups are 
generally not specified, but may be present nonetheless. 
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The term "receptor" denotes a cell -associated 
protein that binds to a bioactive molecule (i.e., a 
ligand) and mediates the effect of the ligand on the cell. 
5 Membrane -bound receptors are characterized by a multi- 
domain structure comprising an extracellular ligand- 
binding domain and an intracellular effector domain that 
is typically involved in signal transduction. Binding of 
ligand to receptor results in a conformational change in 

10 the receptor that causes an interaction between the 
effector domain and other molecule(s) in the cell. This 
interaction in turn leads to an alteration in the 
metabolism of the cell. Metabolic events that are linked 
to receptor- ligand interactions include gene 

15 transcription, phosphorylation, dephosphorylation, 

increases in cyclic AMP production, mobilization of 
cellular calcium, mobilization of membrane lipids, cell 
adhesion, hydrolysis of inositol lipids and hydrolysis of 
phospholipids. In general, receptors can be membrane 

20 bound, cytosolic or nuclear; monomeric (e.g., thyroid 
stimulating hormone receptor, beta-adrenergic receptor) or 
multimeric (e.g., PDGF receptor, growth hormone receptor, 
IL-3 receptor, GM-CSF receptor, G-CSF receptor, 
erythropoietin receptor and IL-6 receptor) . 

25 

The term "secretory signal sequence" denotes a 
DNA sequence that encodes a polypeptide (a "secretory 
peptide") that, as a component of a larger polypeptide, 
directs the larger polypeptide through a secretory pathway 
30 of a cell in which it is synthesized. The larger 
polypeptide is commonly cleaved to remove the secretory 
peptide during transit through the secretory pathway. 

The term "splice variant" is used herein to 
35 denote alternative forms of RNA transcribed from a gene. 
Splice variation arises naturally through use of 
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alternative splicing sites within a transcribed RNA 
molecule, or less commonly between separately transcribed 
RNA molecules, and may result in several mRNAs transcribed 
from the same gene. Splice variants may encode 

5 polypeptides having altered amino acid sequence. The term 
splice variant is also used herein to denote a protein 
encoded by a splice variant of an mRNA transcribed from a 
gene . 

10 A "soluble receptor" is a receptor polypeptide 

that is not bound to a cell membrane. Soluble receptors 
are most commonly ligand-binding receptor polypeptides 
that lack transmembrane and cytoplasmic domains. Soluble 
receptors can comprise additional amino acid residues, 

15 such as affinity tags that provide for purification of the 
polypeptide or provide sites for attachment of the 
polypeptide to a substrate, or immunoglobulin constant 
region sequences. Many cell -surface receptors have 
naturally occurring, soluble counterparts that are 

20 produced by proteolysis or translated from alternatively 
spliced mRNAs. Receptor polypeptides are said to be 
substantially free of transmembrane and intracellular 
polypeptide segments when they lack sufficient portions of 
these segments to provide membrane anchoring or signal 

25 transduction, respectively. 

The present invention is based in part upon the 
discovery of a novel DNA sequence that encodes a protein 
having the structure of a cytokine receptor, including the 
30 conserved WSXWS motif. Analysis of the tissue 

distribution of the mRNA corresponding to this novel DNA 
showed that expression was present in highest amounts in 
placenta, thyroid, heart and skeletal muscle with lower 
levels in prostate and trachea. 



35 
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Cytokine receptors subunits are characterized by 
a mult'i- domain structure comprising a ligand-binding 
domain and an effector domain that is typically involved 
in signal transduction. Multimeric cytokine receptors 
5 include homodimers (e.g., PDGF receptor act and pp 

isoforms, erythropoietin receptor, MPL [thrombopoietin 
receptor] , and G-CSF receptor) , heterodimers whose 
subunits each have ligand-binding and effector domains 
(e.g., PDGF receptor ap isoform) , and multimers having 

10 component subunits with disparate functions (e.g., IL-2, 

IL-3, IL-4, IL-5, IL-6, IL-7, and GM-CSF receptors) . Some 
receptor subunits are common to a plurality of receptors. 
For example, the AIC2B subunit, which cannot bind ligand 
on its own but includes an intracellular signal 

15 transduction domain, is a component of IL-3 and GM-CSF 

receptors. Many cytokine receptors can be placed into one 
of four related families on the basis of their structures 
and functions. Hematopoietic receptors, for example, are 
characterized by the presence of a domain containing 

20 conserved cysteine residues and the WSXWS motif. 

Additional domains, including protein kinase domains; 
fibronectin type III domains; and immunoglobulin domains, 
which are characterized by disulf ide-bonded loops, are 
present in certain hematopoietic receptors. Cytokine 

25 receptor structure has been reviewed by Urdal, Ann. 

Reports Med. Chew. 26:221-228 (1991) and Cosman, Cytokine 
5:95-106 (1993) . It is generally believed that under 
selective pressure for organisms to acquire new biological 
functions, new receptor family members arose from 

30 duplication of existing receptor genes leading to the 
existence of multi-gene families. Family members thus 
contain vestiges of the ancestral gene, and these 
characteristic features can be exploited in the isolation 
and identification of additional family members. The 

35 cytokine receptor superfamily is subdivided as shown in 
Table 1. 
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Table 1 

Cytokine Receptor Superfamily 

Immunoglobulin family 
CSF-1 receptor 
MGF receptor 
IL-1 receptor 
PDGF receptor 
Hematopoietin family 

erythropoietin receptor 
G-CSF receptor 
IL-2 receptor b-subunit 
IL-3 receptor 
IL-4 receptor 
IL-5 receptor 
IL-6 receptor 
IL-7 receptor 
IL-9 receptor 
GM-CSF receptor a-subunit 
GM-CSF receptor b-subunit 
Prolactin receptor 
CNTF receptor 

Oncostatin M receptor 

Leukemia inhibitory factor receptor 

Growth hormone receptor 

MPL 

Leptin receptor 
TNF receptor family 

TNF (p80) receptor 

TNF (p60) receptor 

TNFR-RP 

CD27 

CD3 0 

CD4 0 

4 -IBB 
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Table 1 , cont inued 
OX- 4 0 
Fas 

NGF receptor 

5 Other 

IL-2 receptor a-subunit 
IL-15 receptor a-subunit 
IFN-y receptor 

10 Cell -surface cytokine receptors are further 

characterized by the presence of additional domains. 
These receptors are anchored in the cell membrane by a 
transmembrane domain characterized by a sequence of 
hydrophobic amino acid residues (typically about 21-25 

15 residues) , which is commonly flanked by positively charged 
residues (Lys or Arg) . On the opposite end of the protein 
from the extracellular domain and separated from it by the 
transmembrane domain is an intracellular domain. 

20 The novel receptor of the present invention was 

initially identified by the presence of the conserved 
WSXWS motif. Analysis of a human cDNA clone encoding 
human Zcytor5 (SEQ ID NO:l) revealed an open reading frame 
encoding 422 amino acids (SEQ ID NO: 2) or an allelic 

25 variant reveals an open reading of 425 amino acid 
residues, SEQ ID NO: 3 and SEQ ID NO:4. 

Within preferred embodiments of the invention 
the isolated polynucleotides will hybridize to similar 

30 sized regions of SEQ ID NO:l, SEQ ID NO: 3, or SEQ ID NO: 5, 
or a sequence complementary thereto, under stringent 
conditions. In general, stringent conditions are selected 
to be about 5°C lower than the thermal melting point (T m ) 
for the specific sequence at a defined ionic strength and 

35 pH. The T m is the temperature (under defined ionic 
strength and pH) at which 50% of the target sequence 
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hybridizes to a perfectly matched probe. Typical 
stringent conditions are those in which the salt 
concentration is at least about 0.02 M at pH 7 and the 
temperature is at least about 60°C. As previously noted, 
5 the isolated polynucleotides of the present invention 
include DNA and RNA. Methods for isolating DNA and RNA 
are well known in the art. It is generally preferred to 
isolate RNA from testis, including whole testis tissue 
extracts or testicular cells, such as Sertoli cells, 

10 Leydig cells, spermatogonia, or epididymis, although DNA 
can also be prepared using RNA from other tissues or 
isolated as genomic DNA. Total RNA can be prepared using 
guanidine HC1 extraction followed by isolation by 
centrifugation in a CsCl gradient [Chirgwin et al . , 

15 Biochemistry 28:52-94 (1979)]. Poly (A) + RNA is prepared 
from total RNA using the method of Aviv and Leder, Proc. 
Natl. Acad. Sci. USA 55:1408-1412 (1972). Complementary 
DNA (cDNA) is prepared from poly (A) + RNA using known 
methods. Polynucleotides encoding ZcytorS polypeptides 

20 are then identified and isolated by, for example, 
hybridization or PCR. 



Those skilled in the art will recognize that the 
25 sequences disclosed in SEQ ID NOS:l, 2, 3,4 represent 

single alleles of the human and SEQ ID NOs 5 and 6 of the 
rat ZcytorS receptors. Allelic variants of these 
sequences can be cloned by probing cDNA or genomic 
libraries from different individuals according to standard 
30 procedures . 

The present invention further provides 
counterpart receptors and polynucleotides from other 
species ("species orthologs") . Of particular interest are 
35 ZcytorS receptors from other mammalian species, including 
murine, porcine, ovine, bovine, canine, feline, equine, 
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and other primate receptors. Species orthologs of the 
human and macaque ZcytorS receptors can be cloned using 
information and compositions provided by the present 
invention in combination with conventional cloning 

5 techniques. For example, a cDNA can be cloned using mRNA 
obtained from a tissue or cell type that expresses the 
receptor. Suitable sources of mRNA can be identified by 
probing Northern blots with probes designed from the 
sequences disclosed herein. A library is then prepared 

10 from mRNA of a positive tissue or cell line. A receptor- 
encoding cDNA can then be isolated by a variety of 
methods, such as by probing with a complete or partial 
human or macaque cDNA or with one or more sets of 
degenerate probes based on the disclosed sequences . A 

15 cDNA can also be cloned using the polymerase chain 

reaction, or PCR (Mullis, U.S. Patent No. 4,683,202), 
using primers designed from the sequences disclosed 
herein. Within an additional method, the cDNA library can 
be used to transform or transfect host cells, and 

20 expression of the cDNA of interest can be detected with an 
antibody to the receptor. Similar techniques can also be 
applied to the isolation of genomic clones. 

The polynucleotides of the present invention can 
25 be synthesized using DNA synthesizers. Currently the 
method of choice is the phosphoramidite method. If 
chemically synthesized double stranded DNA is required for 
an application such as the synthesis of a gene or a gene 
fragment, then each complementary strand is made 
30 separately. The production of short genes (60 to 80 bp) 
can be accomplished by synthesizing the complementary 
strands and then annealing them. For the production of 
longer genes (>300 bp) , however, special strategies must 
be invoked, because the coupling efficiency of each cycle 
35 during chemical DNA synthesis is seldom 100%. To overcome 
this problem, synthetic genes (double-stranded) are 



WO 98/49307 



PCT/US98/08865 



18 

assembled in modular form from single-stranded fragments 
that are from 20 to 100 nucleotides in length. 

One method for building a synthetic gene 
5 requires the initial production of a set of overlapping, 
complementary oligonucleotides, each of which is between 
20 to 60 nucleotides long. The sequences of the strands 
are planned so that, after annealing, the two end segments 
of the gene are aligned to give blunt or staggered ends. 

10 Each internal section of the gene has complementary 3 • and 
5 1 terminal extensions, that are designed to base pair 
precisely with an adjacent section. Thus, after the gene 
is assembled, the only remaining requirement to complete 
the process is sealing the nicks along the backbones of 

15 the two strands with T4 DNA ligase. In addition to the 
protein coding sequence, synthetic genes can be designed 
with terminal sequences that facilitate insertion into a 
restriction endonuclease sites of a cloning vector and 
other sequences should also be added that contain signals 

20 for the proper initiation and termination of transcription 
and translation. See Glick, Bernard R. and Jack J. 
Pasternak, Molecular Biotechnology, Principles & 
Applications of Recombinant DNA, (ASM Press, Washington, 
D.C. 1994), Itakura, K. et al . Synthesis and use of 

25 synthetic oligonucleotides. Annu. Rev. Biochem. 53 : 323- 
356 (1984), and Climie, S. et al . Chemical synthesis of 
the thymidylate synthase gene. Proc. Natl. Acad. Sci. USA 
87 :633-637 (1990) . 

30 Another embodiment of the present invention 

provides for a peptide or polypeptide comprising an 
epitope-bearing portion of a polypeptide of the invention. 
The epitope of the this polypeptide portion is an 
immunogenic or antigenic epitope of a polypeptide of the 

35 invention. A region of a protein to which an antibody can 
bind is defined as an "antigenic epitope". See for 
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instance, Geysen, H.M. et al . , Proc. Natl. Acad Sci. USA 
81 ;3998-4002 (1984) . 

As to the selection of peptides or polypeptides 

5 bearing an antigenic epitope (i.e., that contain a region 
of a protein molecule to which an antibody can bind) , it 
is well known in the art that relatively short synthetic 
peptides that mimic part of a protein sequence are 
routinely capable of eliciting an antiserum that reacts 

10 with the partially mimicked protein. See Sutcliffe, J.G. 
et al. Science 219:660-666 (1983). Peptides capable of 
eliciting protein-reactive sera are frequently represented 
in the primary sequence of a protein, can be characterized 
by a set of simple chemical rules, and are confined 

15 neither to immunodominant regions of intact proteins 

(i.e., immunogenic epitopes) nor to the amino or carboxyl 
terminals. Peptides that are extremely hydrophobic and 
those of six or fewer residues generally are ineffective 
at inducing antibodies that bind to the mimicked protein; 

20 longer soluble peptides, especially those containing 
proline residues, usually are effective. 

Antigenic epitope -bearing peptides and 
polypeptides of the invention are therefore useful to 

25 raise antibodies, including monoclonal antibodies, that 
bind specifically to a polypeptide of the invention. 
Antigenic epitope-bearing peptides and polypeptides of the 
present invention contain a sequence of at least nine, 
preferably between 15 to about 30 amino acids contained 

30 within the amino acid sequence of a polypeptide of the 

invention. However, peptides or polypeptides comprising a 
larger portion of an amino acid sequence of the invention, 
containing from 3 0 to 50 amino acids, or any length up to 
and including the entire amino acid sequence of a 

35 polypeptide of the invention, also are useful for inducing 
antibodies that react with the protein. Preferably, the 
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amino acid sequence of the epitope-bearing peptide is 
selected to provide substantial solubility in aqueous 
solvents (i.e., the sequence includes relatively 
hydrophilic residues and hydrophobic residues are 
5 preferably avoided) ; and sequences containing proline 
residues are particularly preferred. All of the 
polypeptides shown in the sequence listing contain 
antigenic epitopes to be used according to the present 
invention, however, specifically designed antigenic 
10 epitopes include the peptides defined by SEQ ID NOs : 32-37. 

The present invention also provides isolated 
receptor polypeptides that are substantially identical to 

15 the receptor polypeptides of SEQ ID NOs : 2, 4 and 6 and 

their species orthologs . By "isolated" is meant a protein 
or polypeptide that is found in a condition other than its 
native environment, such as apart from blood and animal 
tissue. In a preferred form, the isolated polypeptide is 

20 substantially free of other polypeptides, particularly 
other polypeptides of animal origin. It is preferred to 
provide the polypeptides in a highly purified form, i.e. 
greater than 95% pure, more preferably greater than 99% 
pure. The term "substantially homologous" is used herein 

25 to denote polypeptides having 50%, preferably 60%, more 
preferably at least 80%, sequence identity to the 
sequences shown in SEQ ID NO: 2, 4, or 7 or their species 
orthologs. Such polypeptides will more preferably be at 
least 90% identical, and most preferably 95% or more 

30 identical to SEQ ID NO: 2, 4 or 6 or their species 

orthologs. Percent sequence identity is determined by 
conventional methods. See, for example, Altschul et al . , 
Bull. Math. Bio. 48: 603-616 (1986) and Henikoff and 
Henikoff, Proc. Natl. Acad. Sci. USA 85:10915-10919 

35 (1992) . Briefly, two amino acid sequences are aligned to 
optimize the alignment scores using a gap opening penalty 
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of 10, a gap extension penalty of 1, and the "blossom 62" 
scoring matrix of Henikoff and Henikoff (ibid.) as shown 
in Table 2 (amino acids are indicated by the standard one- 
letter codes) . The percent identity is then calculated 
5 as : 

Total number of identical matches 

x 100 

[length of the longer sequence plus the 
number of gaps introduced into the longer 
10 sequence in order to align the two 

sequences] 
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Sequence identity of polynucleotide molecules is 
determined by similar methods using a ratio as disclosed 
above . 



5 Polynucleotides, generally a cDNA sequence, of 

the present invention encode the above-described 
polypeptides. A cDNA sequence which encodes a polypeptide 
of the present invention is comprised of a series of 
co'dons, each amino acid residue of the polypeptide being 

10 encoded by a codon and each codon being comprised of three 
nucleotides. The amino acid residues are encoded by their 
respective codons as follows. 



Alanine (Ala) is encoded by GCA, GCC, GCG or 



15 GCT; 



Cysteine (Cys) is encoded by TGC or TGT; 
Aspartic acid (Asp) is encoded by GAC or GAT 
Glutamic acid (Glu) is encoded by GAA or GAG 
Phenylalanine (Phe) is encoded by TTC or TTT 
20 Glycine (Gly) is encoded by GGA, GGC, GGG or 

GGT; 

Histidine (His) is encoded by CAC or CAT; 
Isoleucine (lie) is encoded by ATA, ATC or ATT; 
Lysine (Lys) is encoded by AAA, or AAG; 
25 Leucine (Leu) is encoded by TTA, TTG, CTA, CTC, 

CTG or CTT; 

Methionine (Met) is encoded by ATG; 
Asparagine (Asn) is encoded by AAC or AAT; 
Proline (Pro) is encoded by CCA, CCC, CCG or 

30 CCT; 

Glutamine (Gin) is encoded by CAA or CAG; 
Arginine (Arg) is encoded by AGA, AGG, CGA, CGC, 
CGG or CGT; 

Serine (Ser) is encoded by AGC, AGT, TCA, TCC, 
35 TCG or TCT; 
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Threonine (Thr) is encoded by ACA, ACC, ACG or 

ACT; 

Valine (Val) is encoded by GTA, GTC, GTG or GTT; 
Tryptophan (Trp) is encoded by TGG; and 
5 Tyrosine (Tyr) is encoded by TAC or TAT. 

It is to be recognized that according to the 
present invention, when a cDNA is claimed as described 
above, it is understood that what is claimed are both the 

10 sense strand, the anti-sense strand, and the DNA as 
double -stranded having both the sense and ant i- sense 
strand annealed together by their respective hydrogen 
bonds. Also claimed is the messenger RNA (mRNA) which 
encodes the polypeptides of the present invention, and 

15 which mRNA is encoded by the above-described cDNA. A 

messenger RNA (mRNA) will encode a polypeptide using the 
same codons as those defined above, with the exception 
that each thymine nucleotide (T) is replaced by a uracil 
nucleotide (U) . 

20 

Substantially identical proteins and 
polypeptides are characterized as having one or more amino 
acid substitutions, deletions or additions. These changes 

25 are preferably of a minor nature, that is conservative 
amino acid substitutions (see Table 3) and other 
substitutions that do not significantly affect the folding 
or activity of the protein or polypeptide; small 
deletions, typically of one to about 3 0 amino acids; and 

30 small amino- or carboxyl- terminal extensions, such as an 
amino- terminal methionine residue, a small linker peptide 
of up to about 2 0-25 residues, or a small extension that 
facilitates purification (an affinity tag) , such as a 
poly-histidine tract, protein A, Nilsson et al . , EMBO J. 

35 4:1075 (1985); Nilsson et al . , Methods Enzymol . 198:3 
(1991) , glutathione S transferase , Smith and Johnson, 
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Gene 67:31 (1988), or other antigenic epitope or binding 
domain. See, in general Ford et al . , Protein Expression 
and Purification 2: 95-107 (1991. DNAs encoding affinity 
tags are available from commercial suppliers (e.g., 
5 Pharmacia Biotech, Piscataway, NJ) . 



Table 3 



Conservative amino acid substitutions 



10 



15 



20 



25 



Basic : 



Acidic ; 



Polar: 



Hydrophobic : 



Aromatic ; 



Small : 



argmme 
lysine 
histidine 
glutamic acid 
aspartic acid 
glutamine 
asparagine 
leucine 
isoleucine 
valine 

phenylalanine 

tryptophan 

tyrosine 

glycine 

alanine 

serine 

threonine 

methionine 



Essential amino acids in the receptor 
polypeptides of the present invention can be identified 

30 according to procedures known in the art, such as site- 
directed mutagenesis or alanine -scanning mutagenesis, 
Cunningham and Wells, Science 244: 1081-1085 (1989); Bass 
et al., Proc. Natl. Acad. Sci. USA 55:4498-4502 (1991). 
In the latter technique, single alanine mutations are 

35 introduced at every residue in the molecule, and the 
resultant mutant molecules are tested for biological 
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activity (e.g., ligand binding and signal transduction) to 
identify amino acid residues that are critical to the 
■ activity of the molecule. Sites of ligand-receptor 

interaction can also be determined by analysis of crystal 
5 structure as determined by such techniques as nuclear 
magnetic resonance, crystallography or photoaf f inity 
labeling. See, for example, de Vos et al . , Science 
255:306-312 (1992); Smith et al . , J. Mol . Biol. 224:899- 
904 (1992); Wlodaver et al . , FEBS Lett. 309:59-64 (1992). 
10 The identities of essential amino acids can also be 
inferred from analysis of homologies with related 
receptors . 

Multiple amino acid substitutions can be made 
15 and tested using known methods of mutagenesis and 

screening, such as those disclosed by Reidhaar-Olson and 
Sauer, Science 241:53-57 (1988) or Bowie and Sauer, Proc. 
Natl. Acad. Sci. USA 86:2152-2156 (1989). Briefly, these 
authors disclose methods for simultaneously randomizing 
20 two or more positions in a polypeptide, selecting for 
functional polypeptide, and then sequencing the 
mutagenized polypeptides to determine the spectrum of 
allowable substitutions at each position. Other methods 
that can be used include phage display, e.g., Lowman et 
25 al., Biochem. 30:10832-10837 (1991); Ladner et al . , U.S. 
Patent No. 5,223,409; Huse, WIPO Publication WO 92/06204) 
and region-directed mutagenesis, Derbyshire et al . , Gene 
46:145 (1986); Ner et al., DNA 7:127, (1988). 

30 Mutagenesis methods as disclosed above can be 

combined with high- throughput screening methods to detect 
activity of cloned, mutagenized receptors in host cells. 
Preferred assays in this regard include cell proliferation 
assays and biosensor-based ligand-binding assays, which 

35 are described below. Mutagenized DNA molecules that encode 
active receptors or portions thereof (e.g., ligand-binding 
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fragments) can be recovered from the host cells and 
rapidly sequenced using modern equipment. These methods 
allow the rapid determination of the importance of 
individual amino acid residues in a polypeptide of 
5 interest, and can be applied to polypeptides of unknown 
structure . 

Using the methods discussed above, one of 
ordinary skill in the art can prepare a variety of 
10 polypeptides that are substantially homologous to residues 
SEQ ID NOs:2, 4, 6, 17, 18, or 19 or allelic variants 
thereof and retain the ligand-binding properties of the 
wild- type receptor . 

15 The receptor polypeptides of the present 

invention, including full-length receptors, receptor 
fragments (e.g. ligand-binding fragments), and fusion 
polypeptides can be produced in genetically engineered 
host cells according to conventional techniques. Suitable 

20 host cells are those cell types that can be transformed or 
transfected with exogenous DNA and grown in culture, and 
include bacteria, fungal cells, and cultured higher 
eukaryotic cells. Eukaryotic cells, particularly cultured 
cells of multicellular organisms, are preferred. 

25 Techniques for manipulating cloned DNA molecules and 

introducing exogenous DNA into a variety of host cells are 
disclosed by Sambrook et al., Molecular Cloning: A 
Laboratory Manual, 2nd ed. , Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY, 1989, and Ausubel et al., 

30 ibid. 

In general, a DNA sequence encoding a ZcytorS 
receptor polypeptide is operably linked to other genetic 
elements required for its expression, generally including 
35 a transcription promoter and terminator, within an 

expression vector. The vector will also commonly contain 
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one or more selectable markers and one or more origins of 
replication, although those skilled in the art will 
recognize that within certain systems selectable markers 
may be provided on separate vectors, and replication of 
5 the exogenous DNA may be provided by integration into the 
host cell genome. Selection of promoters, terminators, 
selectable markers, vectors and other elements is a matter 
of routine design within the level of ordinary skill in 
the art. Many such elements are described in the 
10 literature and are available through commercial suppliers. 

To direct a ZcytorS receptor polypeptide into 
the secretory pathway of a host cell, a secretory signal 
sequence (also known as a leader sequence, prepro sequence 

15 or pre sequence) is provided in the expression vector. 

The secretory signal sequence may be that of the receptor, 
or may be derived from another secreted protein (e.g., t- 
PA) or synthesized de novo. The secretory signal sequence 
is joined to the ZcytorS DNA sequence in the correct 

20 reading frame. Secretory signal sequences are commonly 

positioned 5' to the DNA sequence encoding the polypeptide 
of interest, although certain signal sequences may be 
positioned elsewhere in the DNA sequence of interest (see, 
e.g., Welch et al., U.S. Patent No. 5,037,743; Holland et 

25 al., U.S. Patent No. 5,143,830). 



Cultured mammalian cells are preferred hosts 
within the present invention. Methods for introducing 
exogenous DNA into mammalian host cells include calcium 

30 phosphate -mediated transf ection, Wigler et al., Cell 

14:725 (1978); Corsaro and Pearson, Somatic Cell Genetics 
7:603 (1981), Graham and Van der Eb, Virology 52:456 
(1973), electroporation, Neumann et al., EMBO J". 1:841-845 
(1982), DEAE-dextran mediated transf ection, Ausubel et 

35 al., eds., Current Protocols in Molecular Biology (John 
Wiley and Sons, Inc., NY, 1987), and liposome-mediated 
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transf ection, Hawley-Nelson et al . , Focus 15:73 (1993); 
Ciccarone et al . , Focus 15:80 (1993). The production of 
recombinant polypeptides in cultured mammalian cells is 
disclosed, for example, by Levinson et aJL , U.S. Patent 
5 No. 4,713,339; Hagen et al . , U.S. Patent No. 4,784,950; 
Palmiter et al . , U.S. Patent No. 4,579,821; and Ringold, 
U.S. Patent No. 4,656,134. Suitable cultured mammalian 
cells include the COS-1 (ATCC No. CRL 1650), COS-7 (ATCC 
No. CRL 1651), BHK (ATCC No. CRL 1632), BHK 570 (ATCC No. 

10 CRL 10314), 293 (ATCC No. CRL 1573; Graham et al., J. Gen. 
Virol. 35:59-72, (1977) and Chinese hamster ovary (e.g. 
CHO-K1; ATCC No. CCL 61) cell lines. Additional suitable 
cell lines are known in the art and available from public 
depositories such as the American Type Culture Collection, 

15 Rockville, Maryland. In general, strong transcription 
promoters are preferred, such as promoters from SV-4 0 or 
cytomegalovirus. See, e.g., U.S. Patent No. 4,956,288. 
Other suitable promoters include those from 
metallothionein genes (U.S. Patent Nos. 4,579,821 and 

20 4,601,978,) and the adenovirus major late promoter. 

Drug selection is generally used to select for 
cultured mammalian cells into which foreign DNA has been 
inserted. Such cells are commonly referred to as 

25 " transf ectant s" . Cells that have been cultured in the 
presence of the selective agent and are able to pass the 
gene of interest to their progeny are referred to as 
"stable- transf ectants. " A preferred selectable marker is 
a gene encoding resistance to the antibiotic neomycin. 

30 Selection is carried out in the presence of a neomycin- 
type drug, such as G-418 or the like. Selection systems 
may also be used to increase the expression level of the 
gene of interest, a process referred to as 
"amplification." Amplification is carried put by 

35 culturing transf ectants in the presence of a low level of 
the selective agent and then increasing the amount of 
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selective agent to select for cells that produce high 
levels of the products of the introduced genes. A 
preferred amplifiable selectable marker is dihydrof olate 
reductase, which confers resistance to methotrexate. 
5 Other drug resistance genes (e.g. hygromycin resistance, 
mult i -drug resistance, puromycin acetyltransf erase) can 
also be used. 

Other higher eukaryotic cells can also be used 
10 as hosts, including insect cells, plant cells and avian 
cells. Transformation of insect cells and production of 
foreign polypeptides therein is disclosed by Guarino et 
al., U.S. Patent No. 5,162,222; Bang et al . , U.S. Patent 
No. 4,775,624; and WIPO publication WO 94/06463, which are 
15 incorporated herein by reference. The use of 

Agrobacterium rhizogenes as a vector for expressing genes 
in plant cells has been reviewed by Sinkar et al . , J. 
Biosci. (Bangalore) 11:47-58 (1987) . 

20 Fungal cells, including yeast cells, and 

particularly cells of the genus Saccharomyces , can also be 
used within the present invention, such as for producing 
receptor fragments or polypeptide fusions. Methods for 
transforming yeast cells with exogenous DNA and producing 

25 recombinant polypeptides therefrom are disclosed by, for 
example, Kawasaki, U.S. Patent No. 4,599,311; Kawasaki et 
al., U.S. Patent No. 4,931,373; Brake, U.S. Patent No. 
4,870,008; Welch et al . , U.S. Patent No. 5,037,743; and 
Murray et al., U.S. Patent No. 4,845. Transformed cells 

30 are selected by phenotype determined by the selectable 

marker, commonly drug resistance or the ability to grow in 
the absence of a particular nutrient (e.g., leucine). A 
preferred vector system for use in yeast is the POTl 
vector system disclosed by Kawasaki et al. (U.S. Patent 

35 No. 4,931,373), which allows transformed cells to be 

selected by growth in glucose-containing media. Suitable 



WO 98/49307 



PCT/US98/08865 



31 

promoters and terminators for use in yeast include those 

from glycolytic enzyme genes (see, e.g., Kawasaki, U.S. 

Patent No. 4,599,311; Kingsman et al . , U.S. Patent No. 

4,615,974; and Bitter, U.S. Patent No. 4,977,092) and 
5 alcohol dehydrogenase genes. See also U.S. Patents Nos. 

4,990,446; 5,063,154; 5,139,936 and 4,661,454. 

Transformation systems for other yeasts, including 

Hansenula polymorpha, Schizosaccharomyces pombe, 

Kluyveromyces lactis, Kluyveromyces fragilis, Ustilago 
10 maydis, Pichia pas tor is, Pichia methanolica, Pichia 

guillermondii and Candida maltosa are known in the art. 

See, for example, Gleeson et al . , J*. Gen. Microbiol. 

132:3459-3465 (1986) and Cregg, U.S. Patent No. 4,882,279. 

Aspergillus cells may be utilized according to the methods 
15 of McKnight et al . , U.S. Patent No. 4,93 5,34 9. Methods 

for transforming Acremonium chrysogenum are disclosed by 

Sumino et al., U.S. Patent No. 5,162,228. Methods for 

transforming Neurospora are disclosed by Lambowitz, U.S. 

Patent No. 4,486,533. 

20 

Transformed or transfected host cells are 
cultured according to conventional procedures in a culture 
medium containing nutrients and other components required 
for the growth of the chosen host cells. A variety of 

25 suitable media, including defined media and complex media, 
are known in the art and generally include a carbon 
source, a nitrogen source, essential amino acids, vitamins 
and minerals. Media may also contain such components as 
growth factors or serum, as required. The growth medium 

30 will generally select for cells containing the exogenously 
added DNA by, for example, drug selection or deficiency in 
an essential nutrient which is complemented by the 
selectable marker carried on the expression vector or co- 
transfected into the host cell. 



35 
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Within one aspect of the present invention, a 
novel receptor is produced by a cultured cell, and the 
cell is used to screen for ligands for the receptor, 
including the natural ligand, as well as agonists and 

5 antagonists of the natural ligand. To summarize this 

approach, a cDNA or gene encoding the receptor is combined 
with other genetic elements required for its expression 
{e.g., a transcription promoter), and the resulting 
expression vector is inserted into a host cell. Cells 

10 that express the DNA and produce functional receptor are 
selected and used within a variety of screening systems. 

Mammalian cells suitable for use in expressing 
ZcytorS receptors and transducing a receptor-mediated 

15 signal include cells that express a p- subunit, such as the 
human p c subunit . In this regard it is generally 
preferred to employ a cell that is responsive to other 
cytokines that bind to receptors in the same subfamily, 
such as IL- 3 or GM-CSF, because such cells will contain 

20 the requisite signal transduction pathway (s) . It is also 
preferred to use a cell from the same species as the 
receptor to be expressed. Within a preferred embodiment, 
the cell is dependent upon an exogenously supplied 
hematopoietic growth factor for its proliferation. 

25 Preferred cell lines of this type are the human TF-1 cell 
line (ATCC number CRL-2003) and the AML-193 cell line 
(ATCC number CRL-9589) , which are GM-CSF -dependent human 
leukemic cell lines. In the alternative, suitable host 
cells can be engineered to produce a P-subunit (e.g., b c ) 

30 or other cellular component needed for the desired 
cellular response. For example, the murine cell line 
BaF3, Palacios and Steinmetz, Cell 41: 727-734 (1985); 
Mathey-Prevot et al . , Mol. Cell. Biol. 6: 4133-4135 (1986) 
or a baby hamster kidney (BHK) cell line can be 

35 transfected to express the human b c subunit (also known as 
KH97) as well as a ZcytorS receptor. The latter approach 
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is advantageous because cell lines can be engineered to 
express receptor subunits from any species, thereby 
overcoming potential limitations arising from species 
specificity. In the alternative, species orthologs of the 
5 human receptor cDNA can be cloned and used within cell 
lines from the same species, such as a mouse cDNA in the 
BaF3 cell line. Cell lines that are dependent upon one 
hematopoietic growth factor, such as GM-CSF, can thus be 
engineered to become dependent upon a ZcytorS ligand. 

10 

Cells expressing functional receptor are used 
within screening assays. A variety of suitable assays are 
known in the art. These assays are based on the detection 
of a biological response in a target cell. One such assay 

15 is a cell proliferation assay. Cells are cultured in the 
presence or absence of a test compound, and cell 
proliferation is detected by, for example, measuring 
incorporation of tritiated thymidine or by colorimetric 
assay based on the metabolic breakdown of 3- (4,5- 

20 dimethylthiazol-2-yl) -2, 5-diphenyl tetrazolium bromide 
(MTT) (Mosman, J. Immunol . Meth. 65: 55-63 (1983). An 
alternative assay format uses cells that are further 
engineered to express a reporter gene. The reporter gene 
is linked to a promoter element that is responsive to the 

25 receptor-linked pathway, and the assay detects activation 
of transcription of the reporter gene. A preferred 
promoter element in this regard is a serum response 
element, or SRE, e.g. , Shaw et al., Cell 55:563-572 
(1989) . A preferred reporter gene is a luciferase gene, 

30 de Wet et al . , Mol . Cell. Biol. 7:725 (1987). Expression 
of the luciferase gene is detected by luminescence using 
methods known in the art (e.g., Baumgartner et al., J. 
Biol. Chem. 255:29094-29101 (1994); Schenborn and Goif f in, 
Promega Notes 41:11 (1993). Luciferase activity assay 

35 kits are commercially. available from, for example, Promega 
Corp., Madison, WI . Target cell lines of this type can be 
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used to screen libraries of chemicals, cell-conditioned 
culture media, fungal broths, soil samples, water samples, 
and the like. For example, a bank of cell-conditioned 
media samples can be assayed on a target cell to identify 
5 cells that produce ligand. Positive cells are then used 
to produce a cDNA library in a mammalian expression 
vector, which is divided into pools, transfected into host 
cells, and expressed. Media samples from the transfected 
cells are then assayed, with subsequent division of pools, 
10 re-transf ection, subculturing, and. re-assay of positive 
cells to isolate a cloned cDNA encoding the ligand. 

A natural ligand for the ZcytorS receptor can 
also be identified by mutagenizing a cell line expressing 

15 the receptor and culturing it under conditions that select 
for autocrine growth. See WIPO publication WO 95/21930. 
Within a typical procedure, BaF3 cells expressing ZcytorS 
and human b c are mutagenized, such as with 2- 
ethylmethanesulfonate (EMS) . The cells are then allowed 

20 to recover in the presence of IL-3, then transferred to a 
culture medium lacking IL-3 and IL-4. Surviving cells are 
screened for the production of a ZcytorS ligand, such as 
by adding soluble receptor to the culture medium or by 
assaying conditioned media on wild-type BaF3 cells and 

25 BaF3 cells expressing the receptor. 

An additional screening approach provided by the 
present invention includes the use of hybrid receptor 
polypeptides. These hybrid polypeptides fall into two 

30 general classes. It is preferred that the second receptor 
be a hematopoietic cytokine receptor, such as mpl receptor 
(Souyri et al . , Cell 63: 1137-1147 (1990). The hybrid 
receptor will further comprise a transmembrane domain, 
which may be derived from either receptor. A DNA 

35 construct encoding the hybrid receptor is then inserted 
into a host cell. Cells expressing the hybrid receptor 
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are cultured in the presence of a ligand for the binding 
domain and assayed for a response. This system provides a 
means for analyzing signal transduction mediated by 
ZcytorS while using readily available ligands. This 
5 system can also be used to determine if particular cell 
lines are capable of responding to signals transduced by 
ZcytorS . 

Cells found to express the ligand are then used 
10 to prepare a cDNA library from which the ligand-encoding 
cDNA can be isolated as disclosed above. The present 
invention thus provides, in addition to novel receptor 
polypeptides, methods for cloning polypeptide ligands for 
the receptors. 

15 

Compounds identified as receptor agonists are 
useful for stimulating proliferation and development of 
target cells in vitro and in vivo. For example, agonist 
compounds are useful as components of defined cell culture 

20 media, and may be used alone or in combination with other 
cytokines and hormones to replace serum that is commonly 
used in cell culture. Agonists are thus useful in 
specifically promoting the growth and/or development of 
testis-derived cells in culture. Agonists and antagonists 

25 may also prove useful in the study of spermatogenesis and 
infertility. Antagonists are useful as research reagents 
for characterizing sites of ligand-receptor interaction. 
In vivo, receptor agonists may find application in the 
treatment of male infertility. Antagonists of receptor 

30 function may be useful as male contraceptive agents. 

The proposed cytokine binding domain of Zcytor5 
appears to be closest to the Interleukin-6 p-chain or 
gpl30 (29% identity) . The ligand for ZcytorS is probably 
35 a member of the Interleukin-6 family of cytokines which at 
present includes: Interleukins-6 , -11, Leukemia Inhibitory 
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Factor, Oncostatin M, Cardiotropic 1 and Ciliary 
Neurtrophic Factor. 

All ZcytorS cDNAs isolated thus far do not 
5 encode a transmembrane domain nor any recognizable 

cytoplasmic signaling motifs characteristic of the Class I 
receptors. Structurally, ZcytorS bears close similarity to 
a-subunit of the Ciliary Neutrophic Factor receptor (CNTF- 
Ra) . It is quite possible ZcytorS does not have a 
10 transmembrane domain form and that the native molecule is 
phosphatidyl -inositol linked to the cell membrane in a 
manner similar to CNTF-Rct . 

Rebledo et al . (J. Biol. Chem. , 272: 4855-4863) 

15 provide evidence for the existence of a third component of 
the Cardiotropic 1 receptor (CT-1R) . CT-1R is believed to 
have a tripartite structure comprised of gpl30, gpl90 (LIV 
Receptor p) and an uncharacterized 45kDa (CT-IRa) subunit 
that appears to be linked to the cell surface through a 

20 phosphatidyl -inositol linkage. CT-IRa appears to be 

important for increased sensitivity and specificity of the 
receptor complex to Cardiotropic 1 . The data suggests that 
ZcytorS is CT-IRa . Cardiotropic 1 is a member of the 
Interleukin-6 family in which gpl3 0 and gpl90 are members 

25 of a tripartite complex is the Ciliary Neurotrophic Factor 
receptor. In this receptor complex, CNTF-Ra comprises the 
third receptor subunit and it mediates specificity and 
high affinity binding of the ligand complex. These 
functions are similar to the proposed ones for CT-IRa. One 

30 might then argue on the basis of "symmetry of nature" that 
CT-IRa would physically resemble CNTF-Ra and that the 
close structural similarity of ZcytorS to CNTF-Ra would 
make ZcytorS a possible candidate for the third subunit of 
CT-1R. Furthermore, the proposed 4 5 kDa molecular mass of 

35 CT-Ra agrees with that of ZcytorS and the transcripts of 
CT-1 and Zcytor5 are found in similar tissues. In 
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particular, both transcripts are found in high levels in 
heart and in skeletal muscles, which is consistent with 
the observation that ligand and their receptor subunits 
are often co-expressed. in the same tissue. 

5 

Cardiotropic 1 was originally cloned by function 
as a factor involved in cardiac hypertrophy, an adaptive 
response of heart muscle to an increased work load. 
Hypertrophy is characterized by reactivation of genes 

10 expressed during fetal heart development and by the 

accumulation of carsomeric proteins. If ZcytorS proves to 
be the subunit that is important in the binding and 
specificity of Cardiotropic 1 to its receptor, ZcytorS may 
prove to be a useful therapeutic antagonist to counteract 

15 the hypertrophic response to injury. Cardiotropic 1 has 
also been shown to promote survival of rat dopaminergic 
neurons in vitro. An agonist-active soluble receptor may 
potentially be useful in the treatment of neuronal 
disorders such as Parkinson's disease. 

20 

ZcytorS may also be used within diagnostic 
systems for the detection of circulating levels of ligand. 
Within a related embodiment, antibodies or other agents 
that specifically bind to ZcytorS can be used to detect 
25 circulating receptor polypeptides. Elevated or depressed 
levels of ligand or receptor polypeptides may be 
indicative of pathological conditions, including cancer. 

ZcytorS receptor polypeptides can be prepared by 
30 expressing a DNA encoding a ZcytorS polypeptide as 

described in SEQ ID NO:l, 3 and 5. To direct the export of 
the receptor domain from the host cell, the receptor DNA 
is linked to a second DNA segment encoding a secretory 
peptide. It is believed that amino acid 1-34 or in the 
35 alternative amino acid residues 1-30 are secretory 

peptides of SEQ ID NO:2. For SEQ ID NO: 4, it is believed 
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that residues 1-33 or in the alternative 1-29 are 
secretory peptides. For the rat sequence, it is believed 
that amino acid residues 1-40 define a secretory peptide. 
These peptides are generally cleaved after secretion by a 

5 mammalian cell. In the alternative, other secretory 

peptides could be fused to the ZcytorS polypeptide, such 
as the t-PA secretory peptide. To facilitate purification 
of the secreted receptor domain, a C- terminal extension, 
such as a poly-histidine tag, substance P, Flag™ peptide 

10 (Hopp et al., Biotechnology 6:1204-1210 (1988); available 
from Eastman Kodak Co., New Haven, CT) or another 
polypeptide or protein for which an antibody or other 
specific binding agent is available, can be fused to the 
receptor polypeptide. 

15 

In an alternative approach, a receptor 
extracellular domain can be expressed as a fusion with 
immunoglobulin heavy chain constant regions, typically an 
F c fragment, which contains two constant region domains 

20 and a hinge region but lacks the variable region. Such 
fusions are typically secreted as multimeric molecules 
wherein the Fc portions are disulfide bonded to each other 
and two receptor polypeptides are arrayed in closed 
proximity to each other. Fusions of this type can be used 

25 to affinity purify the cognate ligand from solution, as an 
in vitro assay tool, to block signals in vitro by 
specifically titrating out ligand, and as antagonists in 
vivo by administering them parenterally to bind 
circulating ligand and clear it from the circulation. To 

30 purify ligand, a Zcytor5-Ig chimera is added to a sample 
containing the ligand (e.g., cell-conditioned culture 
media or tissue extracts) under conditions that facilitate 
receptor- ligand binding (typically near-physiological 
temperature, pH, and ionic strength) . The chimera- ligand 

35 complex is then separated by the mixture using protein A, 
which is immobilized on a solid support (e.g., insoluble 
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resin beads) . The ligand is then eluted using 
conventional chemical techniques, such as with a salt or 
pH gradient. In the alternative, the chimera itself can 
be bound to a solid support, with binding and elution 
5 carried out as above. The chimeras may be used in vivo to 
induce infertility. Chimeras with high binding affinity 
are administered parenterally (e.g., by intramuscular, 
subcutaneous or intravenous injection) . Circulating 
molecules bind ligand and are cleared from circulation by 
10 normal physiological processes. For use in assays, the 
chimeras are bound to a support via the F c region and used 
in an ELISA format. 

A preferred assay system employing a ligand - 

15 binding receptor fragment uses a commercially available 
biosensor instrument (BIAcore™, Pharmacia Biosensor, 
Piscataway, NJ) , wherein the receptor fragment is 
immobilized onto the surface of a receptor chip. Use of 
this instrument is disclosed by Karlsson, J. Immunol . 

20 Methods 145:229-240 (1991) and Cunningham and Wells, J. 
Mol. Biol. 234:554-563 (1993). A receptor fragment is 
covalently attached, using amine or sulfhydryl chemistry, 
to dextran fibers that are attached to gold film within 
the flow cell. A test sample is passed through the cell. 

25 If ligand is present in the sample, it will bind to the 
immobilized receptor polypeptide, causing a change in the 
refractive index of the medium, which is detected as a 
change in surface plasmon resonance of the gold film. 
This system allows the determination of on- and off -rates, 

30 from which binding affinity can be calculated, and 
assessment of stoichiometry of binding. 

Ligand-binding receptor polypeptides can also be 
used within other assay systems known in the art. Such 
35 systems include Scatchard analysis for determination of 
binding affinity, Scatchard, Ann. NY Acad. Sci . 51: 660- 
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672 (1949) and calorimetric assays, Cunningham et al . , 
Science 253:545-548 (1991); Cunningham et al . , Science 
254:821-825 (1991) . 

5 A receptor ligand-binding polypeptide can also 

be used for purification of ligand. The receptor 
polypeptide is immobilized on a solid support, such as 
beads of agarose, cross -linked agarose, glass, cellulosic 
resins, silica-based resins, polystyrene, cross-linked 

10 polyacrylamide, or like materials that are stable under 
the conditions of use. Methods for linking polypeptides 
to solid supports are known in the art, and include amine 
chemistry, cyanogen bromide activation, N- 
hydroxysuccinimide activation, epoxide activation, 

15 sulfhydryl activation, and hydrazide activation. The 

resulting media will generally be configured in the form 
of a column, and fluids containing ligand are passed 
through the column one or more times to allow ligand to 
bind to the receptor polypeptide. The ligand is then 

20 eluted using changes in salt concentration or pH to 
disrupt ligand-receptor binding. 

ZcytorS polypeptides can also be used to prepare 
antibodies that specifically bind to ZcytorS polypeptides. 

25 As used herein, the term "antibodies" includes polyclonal 
antibodies, monoclonal antibodies, antigen-binding 
fragments thereof such as F(ab')2 and Fab fragments, and 
the like, including genetically engineered antibodies. 
Antibodies are defined to be specifically binding if they 

30 bind to a ZcytorS polypeptide with a. K a of greater than or 
equal to 10 7 /M. The affinity of a monoclonal antibody can 
be readily determined by one of ordinary skill in the art 
(see, for example/ Scatchard, ibid.). 

35 ZcytorS polypeptides can also be used to prepare 

antibodies that specifically bind to ZcytorS polypeptides. 
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These antibodies can then be used to manufacture anti- 
idiotype antibodies. As used herein, the term 
"antibodies" includes polyclonal antibodies, monoclonal 
antibodies, antigen-binding fragments thereof such as 
5 F(ab ! )2 and Fab fragments, and the like, including 

genetically engineered antibodies. Antibodies are defined 
to be specifically binding if they bind to a ZcytorS 
polypeptide with a K a of greater than or equal to 10 7 /M. 
The affinity of a monoclonal antibody can be readily 
10 determined by one of ordinary skill in the art (see, for 
example, Scatchard, ibid.). 

Methods for preparing polyclonal and monoclonal 
antibodies are well known in the art (see for example, 

15 Sambrook et al . , Molecular Cloning: A Laboratory Manual , 
(Second Edition) (Cold Spring Harbor, NY, 1989) ; and 
Hurrell, J. G. R. , Ed., Monoclonal Hybridoma Antibodies: 
Techniques and Applications (CRC Press, Inc., Boca Raton, 
FL, 1982) . As would be evident to one of ordinary skill 

20 in the art, polyclonal antibodies can be generated from a 
variety of warm-blooded animals such as horses, cows, 
goats, sheep, dogs, chickens, rabbits, mice, and rats. 
The immunogenicity of a ZcytorS polypeptide may be 
increased through the use of an adjuvant such as Freund's 

25 complete or incomplete adjuvant. A variety of assays 
known to those skilled in the art can be utilized to 
detect antibodies which specifically bind to ZcytorS 
polypeptides. Exemplary assays are described in detail in 
Antibodies: A Laboratory Manual, Harlow and Lane (Eds.), 

30 (Cold Spring Harbor Laboratory Press, 1988) . 

Representative examples of such assays include: concurrent 
Immunoelectrophoresis , radio- immunoassays , radio- 
immunoprecipitations, enzyme- linked immunosorbent assays 
(ELISA) , dot blot assays, inhibition or competition 

35 assays, and sandwich assays. 
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As would be evident to one of ordinary skill in 
the art, polyclonal antibodies can be generated by 
inoculating a variety of warm-blooded animals such as 
horses, cows, goats, sheep, dogs, chickens, rabbits, mice, 
5 hamsters, guinea pigs and rats with a ZcytorS polypeptide 
or a fragment thereof. The immunogenicity of a ZcytorS 
polypeptide may be increased through the use of an 
adjuvant, such as alum (aluminum hydroxide) or Freund's 
complete or incomplete adjuvant. Polypeptides useful for 

10 immunization also include fusion polypeptides, such as 
fusions of ZcytorS or a portion thereof with an 
immunoglobulin polypeptide or with maltose binding 
protein. The polypeptide immunogen may be a full-length 
molecule or a portion thereof. If the polypeptide portion 

15 is "hapten-like" , such portion may be advantageously 
joined or linked to a macromolecular carrier (such as 
keyhole limpet hemocyanin (KLH) , bovine serum albumin 
(BSA) or tetanus toxoid) for immunization. 

20 As used herein, the term "antibodies" includes 

polyclonal antibodies, affinity-purified polyclonal 
antibodies, monoclonal antibodies, and antigen-binding 
fragments, such as F(ab , )2 and Fab proteolytic fragments. 
Genetically engineered intact antibodies or fragments, 

25 such as chimeric antibodies, Fv fragments, single chain 
antibodies and the like, as well as synthetic antigen- 
binding peptides and polypeptides, are also included. 
Non-human antibodies may be humanized by grafting non- 
human CDRs onto human framework and constant regions, or 

30 by incorporating the entire non-human variable domains 
(optionally "cloaking" them with a human-like surface by 
replacement of exposed residues, wherein the result is a 
"veneered" antibody) . In some instances, humanized 
antibodies may retain non-human residues within the human 

35 variable region framework domains to enhance proper 
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biological half-life may be increased, and the potential 
for adverse immune reactions upon administration to humans 
is reduced. 

5 

Alternative techniques for generating or 
selecting antibodies useful herein include in vitro 
exposure of lymphocytes to ZcytorS protein or peptide, and 
selection of antibody display libraries in phage or 

10 similar vectors (for instance, through use of immobilized 
or labeled ZcytorS protein or peptide) . Genes encoding 
polypeptides having potential ZcytorS polypeptide binding 
domains can be obtained by screening random peptide 
libraries displayed on phage (phage display) or on 

15 bacteria, such as E. coli. Nucleotide sequences encoding 
the polypeptides can be obtained in a number of ways, such 
as through random mutagenesis and random polynucleotide 
synthesis. These random peptide display libraries can be 
used to screen for peptides which interact with a known 

20 target which can be a protein or polypeptide, such as a 
ligand or receptor, a biological or synthetic 
macromolecule, or organic or inorganic substances. 
Techniques for creating and screening such random peptide 
display libraries are known in the art (Ladner et al., US 

25 Patent NO. 5,223,409; Ladner et al . , US Patent NO. 

4,946,778; Ladner et al . , US Patent NO. 5,403,484 and 
Ladner et al . , US Patent NO. 5,571,698) and random peptide 
display libraries and kits for screening such libraries 
are available commercially, for instance from Clontech 

30 (Palo Alto, CA) , Invitrogen Inc. (San Diego, CA) , New 
England Biolabs, Inc. (Beverly, MA) and Pharmacia LKB 
Biotechnology Inc. (Piscataway, NJ) . Random peptide 
display libraries can be screened using the ZcytorS 
sequences disclosed herein to identify proteins which bind 

35 to ZcytorS. These "binding proteins" which interact with 
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ZcytorS polypeptides can be used for tagging cells; for 
isolating homolog polypeptides by affinity purification; 
they can be directly or indirectly conjugated to drugs, 
toxins, radionuclides and the like. These binding 
5 proteins can also be used in analytical methods such as 
for screening expression libraries and neutralizing 
activity. The binding proteins can also be used for 
diagnostic assays for determining circulating levels of 
polypeptides; for detecting or quantitating soluble 
10 polypeptides as marker of underlying pathology or disease. 
These binding proteins can also act as ZcytorS 
"antagonists" to block ZcytorS binding and signal 
transduction in vitro and in vivo. 

15 Antibodies can also be generated gene therapy. 

The animal is administered the DNA or RNA which encodes 
ZcytorS or an immunogenic fragment thereof so that cells 
of the animals are transfected with the nucleic acid and 
express the protein which in turn elicits an immunogenic 

20 response. Antibodies which then are produced by the animal 
are isolated in the form of polyclonal or monoclonal 
antibodies . 



25 Antibodies to ZcytorS may be used for tagging 

cells that express the protein, for affinity purification, 
within diagnostic assays for determining circulating 
levels of soluble protein polypeptides, and as antagonists 
to block ligand binding and signal transduction in vitro 

30 and in vivo. 



Methods for preparing polyclonal and monoclonal 
antibodies are well known in the art (see for example, 
35 Sambrook et al . , Molecular Cloning: A Laboratory Manual,, 
Second Edition, Cold Spring Harbor, NY (1989) ; and 
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Hurrell, J. G. R. , Ed., Monoclonal Hybridoma Antibodies: 
Techniques and Applications (CRC Press, Inc., Boca Raton, 
FL, 1982) . As would be evident to one of ordinary skill 
in the art, polyclonal antibodies can be generated from a 
5 variety of warm-blooded animals such as horses, cows, 
goats, sheep, dogs, chickens, rabbits, mice, and rats. 
Generally speaking, antibodies against which bind to the 
claimed ZcytorS polypeptides can be raised by immunization 
of animals with a ZcytorS polypeptide or a fragment 

10 thereof. The immunogenicity of a ZcytorS polypeptide may 
be increased through the use of an adjuvant such as 
Freund's complete or incomplete adjuvant. A variety of 
assays known to those skilled in the art can be utilized 
to detect antibodies which specifically bind to ZcytorS 

15 polypeptides. Exemplary assays are described in detail in 
Antibodies: A Laboratory Manual, Harlow and Lane (Eds.) 
(Cold Spring Harbor Laboratory Press, 1988) . 
Representative examples of such assays include: concurrent 
Immunoelectrophoresis , radio- immunoassays , radio- 

20 immunoprecipitations, enzyme- linked immunosorbent assays 
(ELISA) , dot blot assays, inhibition or competition 
assays, and sandwich assays. 

Antibodies to ZcytorS are may be used for 
25 tagging cells that express the receptor, for affinity 
purification, within diagnostic assays for determining 
circulating levels of soluble receptor polypeptides, and 
as antagonists to block ligand binding and signal 
transduction in vitro and in vivo. 



30 
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Uses 

The tissue specificity of ZcytorS expression 
suggests that ZcytorS may be a receptor for growth and/or 
5 maintenance factor in the thyroid heart and skeletal 
muscle. ZcytorS could therefor be used to down regulate 
the effects of the factor by administering soluble ZcytorS 
to the patient. For example the soluble receptor could be 
used to lessen the effect of cardiotrophin-1 on cardiac 
10 pathologies. Thus preventing enlargement of the heart due 
to heart disease. ZcytorS could also be used as a 
diagnostic to test for the presence of cardiotrophin-1 in 
the blood. Furthermore, ZcytorS can be used to discover 
other possible ligands which would bind to ZcytorS. 

15 

The present invention also provides reagents 
which will find use in diagnostic applications. For 
example, the ZcytorS gene. A probe comprising the ZcytorS 
DNA or RNA or a subsequence thereof can be used to 
20 determine if the ZcytorS gene is present on chromosome 1 
or if a mutation has occurred. 

Antibodies to ZcytorS could be used to purify 
ZcytorS and as a therapeutic to modulate the effect of the 
25 ZcytorS ligand. The anti- idiotypic antibody to ZcytorS 
could be used to purify the ligand of Zcytor5 and the 
administration of the anti -idiotypic antibody could be 
used to modulate the effect of the ZcytorS ligand. 

30 The invention is further illustrated by the following non- 
limiting examples. 
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Example 1 

Cloning of Human Zcvtor5 

Human ZcytorS was identified from expressed 
5 sequence tag (EST) 698365 (SEQ ID NO: 7) identified in an 
EST database. The cDNA containing EST 698365 was obtained 
from Incyte Pharmaceuticals, Inc. as dried DNA. Upon 
reconstitution in water, the cDNA was transfected into E. 
coli strain DH10B . The plasmid was designated pSL8365. 
10 The EST in plasmid pSL8365 was sequenced, revealing an 
insert of 952 bp. 

The GENE TRAPPER® cDNA positive selection system 
(Life Technologies, Gaithersburg, MD) employing 

15 oligonucleotide ZC11,286 (SEQ ID NO: 8) was used to 
isolate the plasmid Hzcytor5-9 from a human lung cDNA 
library (obtained from Life Technologies Inc., 
Gaithersburg, MD) in accordance with the manufacturer's 
directions. Hzcytor5-9 extended the sequence of pSL8365 by 

20 459 bp. The sequence present in Hzcytor5-9 allowed the 

isolation of an overlapping EST No. 485212 (SEQ ID NO: 9) , 
which extended the open reading frame of Hzcytor5-9 by a 
further 33 codons. 

25 A cDNA encoding full-length ZcytorS was isolated 

from a human testis cDNA library. (See Example 2 for the 
preparation of the human cDNA testis library.) The library 
was comprised of eighty pools of plasmid DNA, each pool 
comprised of 10,000 independent recombinants. The presence 

30 of ZcytorS cDNA in each library pool was determined by PCR 
employing primers ZC11,663 (SEQ ID NO: 10) and ZC12,212 
(SEQ ID NO: 11) . PCR was carried out using AMPLITAQ® DNA 
polymerase (Perkin-Elmer) in buffer conditions recommended 
by the supplier. The amplification was carried out at 94° C 

35 for 1 minute followed by 30 cycles, each cycle consisting 
of 20 seconds at 94° C, 1 minute at 66° C and 7 minutes at 
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74° C. Five cDNA pools were found to be positive for the 
420 bp PCR product by agarose gel electrophoresis. 

Plasmid DNA from one positive library pool was 
5 electrophoresed into DH10B cells and plated. Colony lifts 
were prepared using Hybond-N filters (Amersham; Arlington 
Heights, IL) according to the procedure provided by the 
manufacturer. Following denaturation and neutralization, 
DNA was cross-linked onto the filters with 1,200 ^Joules 

10 of UV energy in a STRATAL I NKER® (Stratagene Cloning 
Systems) . Cell debris was removed by several washes in 
0.25X standard sodium citrate (SSC) , 0.25% sodium dodecyl 
sulfate (SDS) and 1 mM EDTA at 65° C. The filters were then 
pre -hybridized overnight at 65° C in EXPRESSHYB® solution 

15 (Clontech) with 1 mg/ml heat denatured salmon sperm DNA. 
Colonies positive for ZcytorS were identified by 
hybridization with a probe that was generated from EST 
484212 (SEQ ID NO: 9) cDNA employing PCR primers ZC11,663 
(SEQ ID NO: 10) and ZC12,212 (SEQ ID NO: 11) . The PCR 

20 product probe was purified by agarose gel electrophoresis. 
100 ng of the probe was labeled with 32 P dCTP using the 
MULTI- PRIME® DNA labeling system (Amersham) . 
Unincorporated label was removed with a NUCTRAP® column 
(Stratagene) . Probe hybridization was carried out 

25 overnight at 65° C in EXPRESSHYB® solution at a probe 

concentration of 1 xlO 6 cpm/ml . The filters were washed at 
65° C in a wash buffer containing 0.25X SSC, 0.25 SDS and 1 
mM EDTA. 

30 Three positive signals were identified and were 

subjected to colony purification via a second round of 
filter hybridization. Sequence analysis of one positive 
clone, SEQ ID NO: 3 was found to be full length human 
Zcytor-5. Sequencing of a several overlapping clones 

35 revealed a second full-length sequence SEQ ID NO: 1 which 
is an allelic variant of SEQ ID NO: 3. 
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Example 2 

Construction of the Human Testis cDNA Library 

5 Fourteen \xl of poly d(T) - selected poly (A) + 

human testis mRNA (Clontech) at a concentration of 1.0 
|ig/|nl was mixed with 2 fil of 20 pmole/^il first strand 
primer ZC2938 (SEQ ID NO: 12 ) containing an Sst I 
restriction site. The mixture was heated at 65° C for 4 

10 minutes and cooled by chilling on ice. First strand cDNA 
synthesis was initiated by the addition of 8 fal of 250 mM 
Tris-HCl, pH 8.3, 375 mM KCl, 15 mM MgCl 2 (5x SUPERSCRIPT™ 
buffer; GIBCO BRL) , 4 |il of 100 mM dithiothreitol (DTT) 
and 2 }il of a deoxynucleotide triphosphate solution 

15 containing 10 mM each of dATP, dGTP, dTTP and 5 -methyl - 
dCTP (Pharmacia LKB Biotechnology Inc.) to the RNA-primer 
mixture. The reaction mixture was incubated at 45° C for 4 
minutes followed by the addition of 10 p.1 of 200 U/\il 
RNase H"~reverse transcriptase (GIBCO BRL) . The efficiency 

20 of the first strand synthesis was analyzed in a parallel 
reaction by the addition of 10 |iCi of 32 P-adCTP to a 10 fil 
aliquot of the reaction mixture to label the reaction for 
analysis. The reactions were incubated at 45° C for 1 hour 
followed by an incubation at 50° C for 15 minutes. 

25 Unincorporated 32 P-adCTP in the labeled reaction was 
removed by chromatography on a 400 pore size gel 
filtration column (CHROMA SPIN + TE-400™; Clontech 
Laboratories Inc.). Unincorporated nucleotides in the 
unlabeled first strand reaction were removed by twice 

30 precipitating the cDNA in the presence of 10 fig of 

glycogen carrier, 2.5 M ammonium acetate and 2.5 volume 
ethanol. The unlabeled cDNA was resuspended in 50 water 
for use in second strand synthesis. The length of the 
labeled first strand cDNA was determined by agarose gel 

35 electrophoresis . 
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Second strand synthesis was performed on first 
strand cDNA under conditions that promoted first strand 
priming of second strand synthesis resulting in DNA 
hairpin formation. The reaction mixture was assembled at 
5 room temperature and was comprised of 66 |al of the 
unlabeled first strand cDNA, 20 ^il of 5X polymerase I 
buffer (100 mM Tris: HC1, pH 7.4, 500 mM KC1, 25 mM MgCl 2 , 
50 mM (NH 4 ) 3 S0 4 ), 1 |Lil of 100 mM DTT, 1 fil of a solution 
containing 20 mM of each deoxynucleotide triphosphate, 3 jj. 

10 1 of 5 mM p-NAD, ljil of 4 JJ/\xl of E. coli DNA ligase (New 
England Biolabs Inc., Beverly, MA) and 5 \il of 10 U/jil E. 
coli DNA polymerase I (New England Biolabs, Inc.) . The 
reaction was incubated at room temperature for 5 minutes 
followed by the addition of 2 (il of 2.2 U/jil RNase H 

15 (GIBCO BRL) . A parallel reaction in which a 10 ^1 aliquot 
of the second strand synthesis mixture was labeled by the 
addition of 10 ^iCi 32 P-ccdCTP was used to monitor the 
efficiency of second strand synthesis. The reactions were 
incubated at 15° C for two hours followed by a 15 minute 

20 incubation at room temperature. Unincorporated 32 P-adCTP in 
the labeled reaction was removed by chromatography through 
a 400 pore size gel filtration column (Clontech 
Laboratories, Inc.) before analysis by agarose gel 
electrophoresis. The unlabeled reaction was terminated by 

25 two extractions with phenol/chloroform and a chloroform 
extraction followed by ethanol precipitation in the 
presence of 2.5 M ammonium acetate. 

The single -stranded DNA of the hairpin structure 
30 was cleaved using mung bean nuclease. The reaction mixture 
contained 100 (il of second strand cDNA, 20 \il of lOx mung 
bean nuclease buffer (Stratagene Cloning Systems, La 
Jolla, CA) , 16 [il of 100 mM DTT, 51.5 jil of water and 12.5 
(il of a 1:10 dilution of mung bean nuclease (Promega 
35 Corp.; final concentration 10.5 U/|al) in mung bean 

nuclease dilution buffer. The reaction was incubated at 37° 
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C for 15 minutes. The reaction was terminated by the 
addition of 20 |il of 1 M Tris: HCl, pH 8 . 0 followed by 
sequential phenol /chloroform and chloroform extractions as 
described above. Following the extractions, the DNA was 
5 precipitated in ethanol and resuspended in water. 

The resuspended cDNA was blunt -ended with T4 DNA 
polymerase. The cDNA, which was resuspended in 13 8 ^1 of 
water, was mixed with 40 (xl of 5X T4 DNA polymerase buffer 

10 (250 mM Tris: HCl, pH 8.0, 250 mM KC1, 25 mM MgCl 2 ) , 3 |xl 
0.1 M DTT, 5 ill of a solution containing 10 mM of each 
deoxynucleotide triphosphate and 4 jxl of 1 U/jal T4 DNA 
polymerase (Boehringer Mannheim Corp., Indianapolis, IN). 
After incubation of 1 hour at 10° C, the reaction was 

15 terminated by the addition of 10 (il of 0.5 M EDTA 

followed by serial phenol/chloroform and chloroform 
extractions as described above. The DNA was 
chromatographed through a 400 pore size gel filtration 
column (Clontech Laboratories Inc. Palo Alto, CA) to 

20 remove trace levels of protein and to remove short cDNAs 
less than about 400 bp in length. The DNA was ethanol 
precipitated in the presence of 12 |ig glycogen carrier and 
2.5 M ammonium acetate and was resuspended in 10 nl of 
water. Based on the incorporation of 32 P-adCTP, the yield 

25 of cDNA was estimated to be about 2 jig from a starting 
template of 12.5 ng. 

Eco RI adapters were ligated onto the 5 * ends of 
the cDNA to enable cloning into a. lambda phage vector. A 

30 10 nl aliquot of cDNA (containing about 2 |ng of cDNA) and 
11 |il of 65 pmole/nl of Eco RI adapter (Pharmacia LKB 
Biotechnology Inc.) were mixed with 3 nl lOx ligase buffer 
(Promega Corp.), 3 \il 10 mM ATP and 3 [il of 15 U/jil T4 DNA 
ligase (Promega Corp.). The reaction was incubated 

35 overnight (about 18 hours) at 12.5° C. The reaction was 

terminated by the addition of 150 \il of water and 10 jil of 
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3 M Na acetate, followed by incubation at 65° C for 30 
minutes. After incubation, the cDNA was extracted with 
phenol/chloroform and chloroform as described above and 
precipitated in the presence of 2.5 M ammonium acetate and 
5 1.2 volume of isopropanol . Following centrif ugation, the 
cDNA pellet was washed with 70% ethanol, air dried and 
resuspended in 89 \il water. 

To facilitate the directional cloning of the 
10 cDNA into a lambda phage vector, the cDNA was digested 
with Sst-I resulting in a cDNA having 5' Eco RI and 3 1 
Sat-I cohesive ends. The Sst-I restriction site at the 3' 
end of the cDNA had been previously introduced through 
primer ZC2938 (SEQ ID NO: 12). Restriction enzyme 
15 digestion was carried out in a reaction containing 89 
of cDNA described above, 10 \il of 6 mM Tris: HC1, 6 mM 
MgCl 2 , 150 mM NaCl, 1 mM DTT (lOx D buffer; Promega Corp., 
Madison, WI) and 1 jal of 12 U/jil Not I (Promega Corp.). 
Digestion was carried out at 37° C for 1 hour. The reaction 
20 was terminated by serial phenol/chloroform and chloroform 
extractions. The cDNA was ethanol precipitated, washed 
with 70% ethanol, air dried and resuspended in 20 \il of Ix 
gel loading buffer (10 mM Tris: HC1, pH 8.0, 1 mM EDTA, 5% 
glycerol and 0.125% bromphenol blue). 

25 

The resuspended cDNA was heated to 65° C for 5 
minutes, cooled on ice and electrophoresed on a 0.8% low 
melt agarose gel ( SEA PLAQUE GTG™ low melt agarose; FMC 
Corp.) . Unincorporated adapters and cDNA below 1.6 kb in 

30 length were excised from the gel . The electrodes were 
reversed, and the cDNA was electrophoresed until 
concentrated near the lane of origin. The area of the gel 
containing the concentrated cDNA was excised and placed in 
a microfuge tube, and the approximate volume of the gel 

35 slice was determined. A 300 \il aliquot of water, 

approximately three times the volume of the gel slice, was 
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added to the tube. The agarose was then melted by heating 
to 65° C for 15 minutes. Following equilibration of the 
sample to 42° C, 10 |al of 1 u/(il (3-agarose I ( New England 
Biolabs, Inc.) was added, and the mixture was incubated 
5 for 90 minutes to digest the agarose. After incubation, 40 
111 of 3 M Na acetate was added to the sample, and the 
mixture was incubated on ice for 15 minutes. The sample 
was centrifuged at 14,000 x g for 15 minutes at room 
temperature to remove undigested agarose. The cDNA in the 
10 supernatant was ethanol precipitated, washed in 70% 

ethanol, air-dried and resuspended in 3 7 \xl of water for 
the kinase reaction to phosphorylate the ligated Eco RI 
adapters . 

15 To the 37 |al cDNA solution described above was 

added 10 nl of lOx ligase buffer (Stratagene Cloning 
Systems) , and the mixture was heated to 65° C for 5 
minutes. The mixture was cooled on ice, and 5 y.1 of 10 mM 
ATP and 3 jul of 10 U/fil of T4 polynucleotide kinase 

20 (Stratagene Cloning Systems) were added. The reaction was 
incubated at 3 7° C for 45 minutes and was terminated by 
heating to 65° C for 10 minutes followed by serial 
extractions with phenol/chloroform and chloroform. The 
phosphorylated cDNA was ethanol precipitated in the 

25 presence of 2.5 M ammonium acetate, washed with 70% 

ethanol, air dried and resuspended in 12.5 ^1 water. The 
concentration of the phosphorylated cDNA was estimated to 
be about 40 fmole/|xl. 
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Example 3 

Northern Blot Analysis of Human ZcvtorS 

A 300bp double stranded DNA probe for Northern 

5 analysis was prepared from pSL1034 by PCR using 

oligonucleotide primers ZC 10,787 (SEQ ID NO: 13) and ZC 
11,097 (SEQ ID NO:14). The 300 bp PCR fragment was gel- 
purified using a QIAQUICK® purification kit (Qiagen Inc., 
Chatsworth, CA) and random-primer labeled using a 

10 MULTI PRIME® kit (Amersham Corp.) . Labeled cDNA was 

purified from free counts using a Stratagene push column. 
Human multiple tissue Northern blots (Clontech 
Laboratories) and a human fetal tissue Northern blot 
(Clontech Laboratories) were pre-hybridized for three 

15 hours at 68 °C using EXPRESSHYB hybridization solution 
(Clontech Laboratories) . The 32 P- labeled cDNA probe was 
then added to 10 mis of fresh hybridization solution at 
10 6 cpm/ml overnight at 68 °C. The blots were washed 
several times at room temperature in wash solution 

20 containing 2X SSC, 0.05% SDS, then with continuous 

agitation for 4 0 min at room temperature. The blots were 
then washed in 0.1X SSC, 0.1% SDS at 50°C for 40 min with 
one change of wash solution. 

25 A single transcript of ~2.3 kb was detected 

after exposure to film. In the multiple tissue blots (MTN, 
MTN II and MTN III; Clontech Laboratories) the transcript 
was present in highest abundance in placenta, thyroid, 
heart and skeletal muscle with lower levels in prostate 

30 and trachea. Trace mRNA levels were found in kidney, 
pancreas, testis, small intestine, colon, lymph node, 
adrenal cortex and bone marrow. 
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Example 4 

Chromosomal Assignment and Placement of Human Zcvtor-5 

ZcytorS was mapped to chromosome 19 using the 
5 commercially available version of the "Stanford G3 
Radiation Hybrid Mapping Panel" (Research Genetics, Inc., 
Huntsville, AL) . The "Stanford G3 RH Panel" contains 
PCRable DNAs from each of 83 radiation hybrid clones of 
the whole human genome, plus two control DNAs (the RM 
10 donor and the A3 recipient) . A publicly available WWW 
server (http : //shgc- www.stanford.edu) allows chromosomal 
localization of markers. 

For the mapping of ZcytorS with the "Stanford G3 RH 

15 Panel", 20 pi reactions were set up in a PCRable 96-well 
microtiter plate (Stratagene, La Jolla, CA) and used in a 
"RoboCycler Gradient 96" thermal cycler (Stratagene) . Each 
of the 85 PCR reactions consisted of 2 fil 10X KlenTaq PGR 
reaction buffer (CLONTECH Laboratories, Inc., Palo Alto, 

20 CA) , 1.6 fil dNTPs mix (2.5 mM each, PERK IN - ELMER , Foster 
City, CA) , 1 fil sense primer, (SEQ ID NO: 13) 5» TAT GGC 
CAG GAC AAC ACA 3', 1 pi antisense primer, (SEQ ID NO:14), 
5- ATA GGG CGT AAA GAG AGC 3', 2 fil "RediLoad" (Research 
Genetics, Inc., Huntsville, AL) , 0.4 fil SOX Advantage 

25 KlenTaq Polymerase Mix (Clontech Laboratories, Inc.), 25 
ng of DNA from an individual hybrid clone or control and x 
fil ddH20 for a total volume of 20 /xl. The reactions were 
overlaid with an equal amount of mineral oil and sealed. 
The PCR cycler conditions were as follows: an initial 1 

30 cycle 5 minute denaturation at 95°C, 35 cycles of a 1 
minute denaturation at 95°C, 1 minute annealing at 66°C and 
1.5 minute extension at 72°C, followed by a final 1 cycle 
extension of 7 minutes at 72°C. The reactions were 
separated by electrophoresis on a 2% agarose gel (Life 

35 Technologies, Gaithersburg, MD) . 
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The results showed linkage of ZcytorS to the 
framework marker WI-7289 with a LOD score of >10 and at a 
distance of 14 . 67cR_l0000 from the marker. The use of 
surrounding markers positions ZcytorS in the 19pl3.1-pll 
5 region on the integrated LDB chromosome 19 map (The 
Genetic Location Database, University of Southhampton, WWW 
server : http : //cedar . genetics . soton . ac . uk/publicjtitml/ ) . 

Example 5 
10 Cloning of the Rat ZcvtorS Gene 

Rat ZcytorS cDNA encoding ZcytorS was 
isolated from an amplified Rat testis cDNA library with a 
probe that was generated by primers ZC12212 (SEQ ID NO: 

15 11) and ZC10785 (SEQ ID NO: 15) and 10 ng of plasmid 

pSL.85212 as a template obtained from cDNA containing EST 
698365 as described in Example 1. The probe was prepared 
by PCR by combining 1 \xl containing 10 ng of pSL85212, 1 
\il of ZC12212 having a concentration of 20 pmole/|il, l|il 

20 of ZC10785 having a concentration of 20 pmole/|al, 0.5|il of 
dNTP having a concentration of 20 mM of dATP, dGTP, dCTP 
and dTTP, 5 |al of 10X Klentaq polymerase buffer (Clontech) 
5|il Klentaq DNA polymerase (Clontech) and 3 9.5^1 water. 
The amplification was carried out at 94° C for 1 minute 

25 followed by 30 cycles, each cycle consisting of 15 seconds 
at 95° C, 20 seconds at 62° C and 1 minutes at 68° C. The 
reaction had a final incubation at 68° C for 10 minutes. 

The resulting PCR product was diluted 1:100 with 
30 water. Four |al of the diluted PCR product was re-amplified 
using the above-described conditions and the resultant PCR 
product was further purified by electrophoresis on low- 
melt agarose gel. The DNA probe was recovered from low- 
melt gel by digestion with P-Agarose I digestion. The rat 
35 ZcytorS gene was then cloned from a rat testis library 
which was constructed as described below in Example 6 . 
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In cloning the rat ZcytorS gene, the library was 
first amplified by plating 3.10 s plaque forming units (pfu) 
from the previously constructed primary library onto 98 
5 150 mm NZY plates. Ten ml of serum medium was added to 
each plate and was incubated for several hours at room 
temperature. Following incubation, the phage lysates were 
collected and pooled to yield the amplified phage library. 

1.5 million pfus from the amplified rat testis 
cDNA library were plated onto 150 mm NZY plates at a 
density of 40,000 pfu/plate on XL-1 Blue MRF 1 host cells. 
Following incubation at 37° C overnight, filter lifts were 
made using HYBOND-N™ membranes (Amersham) , according to 
the procedures provided by the manufacturer. The filters 
were processed by denaturation in solution containing 1.5 
M NaCl and 0 . 5 M NaOH for 8 minutes at room temperature . 
The filters were neutralized in 0.5 M Tris: HC1, pH 7.2 
for 5 minutes. Phage DNA was fixed onto the filters with 
1,200 (iJoules of UV energy in a UV Cross-linker 
(Stratagene) . The filters were then washed with 0.25X SSC 
at 70° C to remove excess cellular debris. Filter pre- 
hybridization was carried out in a hybridization solution 
containing 5X SSC, 5X Denhardt solution, 0.2% SDS, 1 mM 
EDTA and heat denatured sheared salmon- sperm DNA at a 
final concentration of 100 yg/ml for 72 hours at 60° C. 

75 ng of probe DNA was labeled with 32 P-dCTP 
using a MEGAPRIME® labeling kit (Amersham) and was 
30 purified with a NUCTRAP® column (Stratagene) . The labeled 
probe was heat-denatured and added to fresh hybridization 
solution at a concentration of 1.5 x 10 6 cpm/ml . Into this 
solution were also added the filters containing the phage 
particles. Hybridization of the probes to the phage- 
35 containing filters was completed overnight at 45° C. 
Following hybridization, the filters were washed in a 
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solution containing 0.25X SSC, 0.25% SDS and 1 mM EDTA at 
50° C. The washed filters were autoradiographed for 72 
hours at -70° C with intensifying screens. Examination of 
the autoradiographs revealed multiple regions that 
5 hybridized with the labeled probe. Agar plugs were picked 
from 56 regions for plaque purification. Of the positive 
signals, eleven produce positive phagemids following 
secondary and tertiary hybridization screens. The plasmids 
within the positive phagemids were recovered using the 
10 EXASSIT/SOLR™ system according to the manufacturer's 
specifications. A clone designated pSLRatR5-l was 
sequenced and found to encode full length Rat ZcytorS (SEQ 
ID NO: 5) 

15 

Example 6 

Production of Rat Testis cDNA library 

The rat first strand cDNA reaction contained 10 

20 ^il of rat testis poly d (T) -selected poly (A) + mRNA 

(Clontech, Palo Alto, CA) at a concentration of 1 . 0 \ig/\il, 
and 2 ^1 of 20 pmole/^1 first strand primer ZC6091 (SEQ ID 
NO: 16) containing an Xho I restriction site. The mixture 
was heated at 70°C for 4 minutes and cooled by chilling on 

25 ice. First strand cDNA synthesis was initiated by the 
addition of 8 jil of first strand buffer (5x SUPERSCRIPT™ 
buffer; Life Technologies, Gaithersburg, MD) , 4 jal of 100 
mM dithiothreitol, and 2 jil of a deoxynucleotide 
triphosphate solution containing 10 mM each of dATP, dGTP 

30 and 5-methyl-dCTP (Pharmacia LKB Biotechnology, 

Piscataway, NJ) to the RNA-primer mixture. The reaction 
mixture was incubated at 45° C for 2 minutes, followed by 
the addition of 10 jlxI of 200 U/>1 RNase H- reverse 
transcriptase (SUPERSCRIPT II®; Life Technologies) . The 

35 efficiency of the first strand synthesis was analyzed in a 
parallel reaction by the addition of 10 fiCi of 32 P-adCTP 



WO 98/49307 



PCT/US98/08865 



59 

to 5 |il aliquot from one of the reaction mixtures to label 
the reaction for analysis. The reactions were incubated 
at 45°C for 1 hour followed by an incubation at 50° C for 
10 minutes. Unincorporated 32 P-ctdCTP in the labeled 

5 reaction was removed by chromatography on a 400 pore size 
gel filtration column (Clontech) . The unincorporated 
nucleotides and primers in the unlabeled first strand 
retains were removed by chromatography on 4 00 pore size 
gel filtration column (Clontech) . The length of labeled 

10 first strand cDNA was determined by agarose gel 
electrophoresis . 

The second strand reaction contained 102 \il of 
the unlabeled first strand cDNA, 30 jal of 5x polymerase I 

15 buffer (125 mM Tris: HC1, pH 7.5, 500 mM KC1, 25 mM 

MgCl 2 , 50mM (NH 4 ) 2 S0 4))> 2 ^1 of 100 mM dithiothreitol , 3 
|il of a solution containing 10 mM of each deoxynucleotide 
triphosphate, 5 nl of 5 mM p-NAD, 2 fil of 3 U/nl E. coli 
DNA ligase (New England Biolabs) , 5 \xl of 10 U/|il E. coli 

20 DNA polymerase I (New England Biolabs), and 1.5 (il of 2 U/ 
|il RNase H (Life Technologies) . A 10 jal aliquot from one 
of the second strand synthesis reactions was labeled by 
the addition of 10 |iCi 32 P-adCTP to monitor the efficiency 
of second strand synthesis. The reactions were incubated 

25 at 16° C for two hours, followed by the addition of 10 [il 
T4 DNA polymerase (10 U/jil, Boerhinger Mannheim) and 
incubated for an additional 5 minutes at 16°C. 
Unincorporated 32 P-ctdCTP in the labeled reaction was 
removed by chromatography through a 4 00 pore size gel 

30 filtration column (Clontech) before analysis by agarose 
gel electrophoresis. The unlabeled was terminated by the 
addition of 20 ^il 0.5 EDTA and extraction with phenol/ 
chloroform and chloroform followed by ethanol 
precipitation in the presence of 2 . 5 M ammonium acetate, 

35 The yield of cDNA was estimated to be approximately 2 \ig 
from starting mRNA template of 10 jag. 
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Eco RI adapters were ligated onto the 5' ends of 
the cDNA described above to enable cloning into an 
expression vector. A 10.5 \il aliquot of cDNA (~2 fig) and 
5 5 nl of 65 pmole/|il of Eco RI adapter (Pharmacia LKB 
Biotechnology Inc.) were mixed with 2 . 5 \xl lOx ligase 
buffer 66 mM Tris-HCl pH 7.5, 10 mM MgCl 2 , 2.5 |il of 10 mM 
ATP and 1 jil of 15 U/jil T4 DNA ligase (Promega Corp., 
Madison, WI) . The reaction was incubated overnight ("12 

10 hours) at 12°C. The reaction was terminated by incubation 
at 70°C for 20 minutes. After incubation, the reaction 
was cooled to 37°C. To the reaction was added 2.5 (il lOmM 
ATP and 3 jil 10 U/jal T4 polynucleotide kinase (Life 
Technologies) to phosphorylate the ligated Eco RI 

15 adapters . 

To facilitate the directional cloning of the 
cDNA into an expression vector, the cDNA was digested with 
Xho I, resulting in a cDNA having a 5' Eco RI cohesive end 

20 and a 3 ! Xho I cohesive end. The Xho I restriction site 
at the 3 1 end of the cDNA had been previously introduced 
using the ZC6091 primer (SEQ ID NO: 3) . Restriction 
enzyme digestion was carried out in a reaction mixture 
containing 25 pi of cDNA described above, 15 )il of lOx H 

25 Buffer (Boehringer Mannheim), 109 ^1 H 2 0, and 1.0 ^1 of 40 
U/^l Xho I (Boehringer Mannheim) . Digestion was carried 
out at 37°C for 40 minutes. The reaction was terminated by 
incubation at 65° C for 10 minutes and chromatography 
through a 400 pore size gel filtration column (Clontech) . 

30 

The cDNA was ethanol precipitated, washed with 
70% ethanol, air dried and resuspended in 20 jil of lx gel 
loading buffer (10 mM Tris:HCl, pH 8.0, 1 mM EDTA, 5% 
glycerol and 0.125% bromphenol blue) . The resuspended 
35 cDNA was heated to 65° C for 5 minutes, cooled on ice and 
electrophoresed on a 0.8% low melt agarose gel. The 
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contaminating adapters and cDNA below 0 . 6 Kb in length 
were excised from the gel. The electrodes were reversed, 
and the cDNA was electrophoresed until concentrated near 
the lane origin. The area of the gel containing the 
5 concentrated cDNA was excised and placed in a microfuge 
tube, and the approximate volume of the gel slice was 
determined. An aliquot of water approximately three times 
the volume of the gel slice (300 \xl) and 35 nl lOx P- 
agarose I buffer (New England Biolabs) was added to the 

10 tube, and the agarose was melted by heating to 65°C for 15 
minutes. Following equilibration of the sample to 45°C, 
3 |il of 1 U/jil P-agarose I (New England Biolabs) was 
added, and the mixture was incubated for 60 minutes at 45°C 
to digest the agarose. After incubation, 40 \xl of 3 M Na 

15 acetate was added to the sample, and the mixture was 
incubated on ice for 15 minutes. The sample was 
centrifuged at 14,000 x g for 15 minutes at room 
temperature to remove undigested agarose. The cDNA was 
ethanol precipitated, washed in 70% ethanol, air-dried and 

20 resuspended in 10 |il water. 

The resulting cDNA was cloned into the lambda 
phage vector A,ZapII that was predigested with Eco RI and 
Xho I and dephosphorylated (Stratagene Cloning. Systems, La 

25 Jolla, CA) . Ligation of the cDNA to the XZapII vector was 
carried out in a reaction mixture containing 1.0 |il of 
prepared vector, 1.0 ^1 of rat testis cDNA, 1.0 \il 10X 
Ligase Buffer (Promega) , 1.0 |il of 10 mM ATP, 5 |il water, 
and 1.0 nl of T4 DNA Ligase at 15 units/ml (Promega) . The 

30 ligation mixture was incubated at 5°C-15°C overnight in a 
temperature gradient. After incubation, the ligation 
mixture was packaged into phage using GIGPACK III GOLD 
packaging extract (Stratagene Cloning Systems) and the 
resulting library was titered according to the 

35 manufacturer's specifications. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: ZymoGenetics , Inc. 

1201 Eastlake Avenue East 

Seattle 

WA 

USA 

98102 

(ii) TITLE OF THE INVENTION: MAMMALIAN ZCYT0R5 



(iii) NUMBER OF SEQUENCES: 37 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Zymogenetics 

(B) STREET: 1201 Eastlake Ave East 

(C) CITY: Seattle 

(D) STATE: WA 

(E) COUNTRY: USA 

(F) ZIP: 98102 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Lunn. Paul G 

(B) REGISTRATION NUMBER: 32.743 
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(C) REFERENCE/DOCKET NUMBER: 96-22PC 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 206-442-6627 

(B) TELEFAX: 206-442-6678 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 52... 1317 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CGCCACGCGC CGAGCCGCAG CCCGCCGCGC GCCCCCGGCA GCGCCGGCCC C ATG CCC 57 

Met Pro 
1 

GCC GGC CGC CGG GGC CCC GCC GCC CAA TCC GCG CGG CGG CCG CCG CCG 105 
Ala Gly Arg Arg Gly Pro Ala Ala Gin Ser Ala Arg Arg Pro Pro Pro 
5 10 15 

TTG CTG CCC CTG CTG CTG CTG CTC TGC GTC CTC GGG GCG CCG CGA GCC 153 
Leu Leu Pro Leu Leu Leu Leu Leu Cys Val Leu Gly Ala Pro Arg Ala 
20 25 30 

GGA TCA GGA GCC CAC ACA GCT GTG ATC AGT CCC CAG GAT CCC ACG CTT 201 
Gly Ser Gly Ala His Thr Ala Val He Ser Pro Gin Asp Pro Thr Leu 
35 40 45 50 

CTC ATC GGC TCC TCC CTG CTG GCC ACC TGC TCA GTG CAC GGA GAC CCA 249 
Leu He Gly Ser Ser Leu Leu Ala Thr Cys Ser Val His Gly Asp Pro 
55 60 65 
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CCA GGA GCC ACC GCC GAG GGC CTC TAC TGG ACC CTC AAT GGG CGC CGC 297 

Pro Gly Ala Thr Ala Glu Gly Leu Tyr Trp Thr Leu Asn Gly Arg Arg 
70 75 80 

CTG CCC CCT GAG CTC TCC CGT GTA CTC AAC GCC TCC ACC TTG GCT CTG 345 

Leu Pro Pro Glu Leu Ser Arg Val Leu Asn Ala Ser Thr Leu Ala Leu 
85 90 95 

GCC CTG GCC AAC CTC AAT GGG TCC AGG CAG CGG TCG GGG GAC AAC CTC 393 

Ala Leu Ala Asn Leu Asn Gly Ser Arg Gin Arg Ser Gly Asp Asn Leu 
100 105 110 

GTG TGC CAC GCC CGT GAC GGC AGC ATC CTG GCT GGC TCC TGC CTC TAT 441 

Val Cys His Ala Arg Asp Gly Ser He Leu Ala Gly Ser Cys Leu Tyr 

115 120 125 130 

GTT GGC CTG CCC CCA GAG AAA CCC GTC AAC ATC AGC TGC TGG TCC AAG 489 

Val Gly Leu Pro Pro Glu Lys Pro Val Asn He Ser Cys Trp Ser Lys 
135 140 145 

AAC ATG AAG GAC TTG ACC TGC CGC TGG ACG CCA GGG GCC CAC GGG GAG 537 

Asn Met Lys Asp Leu Thr Cys Arg Trp Thr Pro Gly Ala His Gly Glu 
150 155 160 

ACC TTC CTC CAC ACC AAC TAC TCC CTC AAG TAC AAG CTT AGG TGG TAT 585 

Thr Phe Leu His Thr Asn Tyr Ser Leu Lys Tyr Lys Leu Arg Trp Tyr 
165 170 175 

GGC CAG GAC AAC ACA TGT GAG GAG TAC CAC ACA GTG GGG CCC CAC TCC 633 

Gly Gin Asp Asn Thr Cys Glu Glu Tyr His Thr Val Gly Pro His Ser 
180 185 190 

TGC CAC ATC CCC AAG GAC CTG GCT CTC TTT ACG CCC TAT GAG ATC TGG 681 

Cys His He Pro Lys Asp Leu Ala Leu Phe Thr Pro Tyr Glu He Trp 

195 200 205 210 

GTG GAG GCC ACC AAC CGC CTG GGC TCT GCC CGC TCC GAT GTA CTC ACG 729 

Val Glu Ala Thr Asn Arg Leu Gly Ser Ala Arg Ser Asp Val Leu Thr 
215 220 225 

CTG GAT ATC CTG GAT GTG GTG ACC ACG GAC CCC CCG CCC GAA GTG CAC 777 

Leu Asp He Leu Asp Val Val Thr Thr Asp Pro Pro Pro Glu Val His 
230 235 240 
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GTG AGC CGC GTC GGG GGC CTG GAG GAC CAG CTG AGC GTG CGC TGG GTG 825 
Val Ser Arg Val Gly Gly Leu Glu Asp Gin Leu Ser Val Arg Trp Val 
245 250 255 

TCG CCA CCC GCC CTC AAG GAT TTC CTC TTT CAA GCC AAA TAC CAG ATC 873 
Ser Pro Pro Ala Leu Lys Asp Phe Leu Phe Gin Ala Lys Tyr Gin He 
260 265 270 

CGC TAC CGA GTG GAG GAC AGT GTG GAC TGG AAG GTG GTG GAC GAT GTG 921 
Arg Tyr Arg Val Glu Asp Ser Val Asp Trp Lys Val Val Asp Asp Val 
275 280 285 290 

AGC AAC CAG ACC TCC TGC CGC CTG GCC GGC CTG AAA CCC GGC ACC GTG 969 
Ser Asn Gin Thr Ser Cys Arg Leu Ala Gly Leu Lys Pro Gly Thr Val 
295 300 305 

TAC TTC GTG CAA GTG CGC TGC AAC CCC TTT GGC ATC TAT GGC TCC AAG 1017 
Tyr Phe Val Gin Val Arg Cys Asn Pro Phe Gly He Tyr Gly Ser Lys 
310 315 320 

AAA GCC GGG ATC TGG AGT GAG TGG AGC CAC CCC ACA GCC GCC TCC ACT 1065 
Lys Ala Gly He Trp Ser Glu Trp Ser His Pro Thr Ala Ala Ser Thr 
325 330 335 

CCC CGC AGT GAG CGC CCG GGC CCG GGC GGC GGG GCG TGC GAA CCG CGG 1113 
Pro Arg Ser Glu Arg Pro Gly Pro Gly Gly Gly Ala Cys Glu Pro Arg 
340 345 350 

GGC GGA GAG CCG AGC TCG GGG CCG GTG CGG CGC GAG CTC AAG CAG TTC 1161 
Gly Gly Glu Pro Ser Ser Gly Pro Val Arg Arg Glu Leu Lys Gin Phe 
355 360 365 370 

CTG GGC TGG CTC AAG AAG CAC GCG TAC TGC TCC AAC CTC AGC TTC CGC 1209 
Leu Gly Trp Leu Lys Lys His Ala Tyr Cys Ser Asn Leu Ser Phe Arg 
375 380 385 

CTC TAC GAC CAG TGG CGA GCC TGG ATG CAG AAG TCG CAC AAG ACC CGC 1257 
Leu Tyr Asp Gin Trp Arg Ala Trp Met Gin Lys Ser His Lys Thr Arg 
390 395 400 

AAC CAG GAC GAG GGG ATC CTG CCC TCG GGC AGA CGG GGC ACG GCG AGA 1305 
Asn Gin Asp Glu Gly He Leu Pro Ser Gly Arg Arg Gly Thr Ala Arg 
405 410 415 
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GGT CCT GCC AGA TAAGCTGTAG GGGCTCAGGC CACCCTCCCT GCCACGTGGA GACGC 1362 
Gly Pro Ala Arg 
420 

AGAGGCCGAA CCCAAACTGG GGCCACCTCT GTACCCTCAC TTCAGGGCAC CTGAGCCACC 1422 

CTCAGCAAGA GCTGGGGTGG CCCCTGAGCT CCAACGGCCA TAACAGCTCT GACTCCCACG 1482 

TGAGGCCACC TTTGGGTGCA CCCCAGTGGG TGTGTGTGTG TGTGTGAGGG TTGGTTGAGT 1542 

TGCCTAGAAC CCCTGCCAGG GCTGGGGGTG AGAAGGGGAG TCATTACTCC CCATTACCTA 1602 

GGGCCCCTCC AAAAGAGTCC TTTTAAATAA ATGAGCTATT TAGGTGCAAA AAAAAAAAAA 1662 

AAAAAAAAAT TGCCCTCGTG CCGAATTC 1690 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 



Met 


Pro 


Ala 


Gly 


Arg 


Arg 


Gly 


Pro 


Ala 


Ala 


Gin 


Ser Ala Arg Arg Pro 


1 








5 










10 




15 


Pro 


Pro 


Leu 


Leu 


Pro 


Leu 


Leu 


Leu 


Leu 


Leu 


Cys 


Val Leu Gly Ala Pro 








20 










25 






30 


Arg 


Ala 


Gly 


Ser 


Gly 


Ala 


His 


Thr 


Ala 


Val 


He 


Ser Pro Gin Asp Pro 






35 










40 








45 


Thr 


Leu 


Leu 


He 


Gly 


Ser 


Ser 


Leu 


Leu 


Ala 


Thr 


Cys Ser Val His Gly 




50 










55 










60 


Asp 


Pro 


Pro 


Gly 


Ala 


Thr 


Ala 


Glu 


Gly 


Leu 


Tyr 


Trp Thr Leu Asn Gly 


65 










70 










75 


80 


Arg 


Arg 


Leu 


Pro 


Pro 


Glu 


Leu 


Ser 


Arg 


Val 


Leu 


Asn Ala Ser Thr Leu 










85 










90 




95 


Ala 


Leu 


Ala 


Leu 


Ala 


Asn 


Leu 


Asn 


Gly 


Ser 


Arg 


Gin Arg Ser Gly Asp 








100 










105 






110 


Asn 


Leu 


Val 


Cys 


His 


Ala 


Arg 


Asp 


Gly 


Ser 


He 


Leu Ala Gly Ser Cys 






115 










120 








125 


Leu 


Tyr 


Val 


Gly 


Leu 


Pro 


Pro 


Glu 


Lys 


Pro 


Val 


Asn He Ser Cys Trp 




130 










135 










140 


Ser 


Lys 


Asn 


Met 


Lys 


Asp 


Leu 


Thr 


Cys 


Arg 


Trp 


Thr Pro Gly Ala His 


145 










150 










155 


160 
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pi w 
biy 


pin 

b IU 


inr 


rhe 


Leu 
lob 


His 


Thr 


Asn 


Tyr 


Ser Leu 

170 


Lys Tyr Lys Leu Arg 
175 


Trp 


Tyr 


PI w 

b iy 


pi *-i 
bin 

180 


Asp 


Asn 


Thr 


Cys 


Glu 
185 


Glu Tyr 


His Thr Val Gly Pro 
190 


Ml S 


Ser 


tys 
195 


HlS 


T "1 _ 

lie 


Pro 


Lys 


Asp 
200 


Leu 


Ala Leu 


Phe Thr Pro Tyr Glu 
205 


i ie 


Trp 
210 


va I 


PI t I 

blU 


Ala 


Thr 


Asn 
215 


Arg 


Leu 


Gly Ser 


Ala Arg Ser Asp Val 
220 


Leu 


Inr 


Leu 


Asp 


T 1 

He 


Leu 


Asp 


Val 


Val 


Thr Thr 


Asp Pro Pro Pro Glu 


225 










230 








235 


240 


Val 


1 1 • _ 
His 


Va 1 


Ser 


Arg 

245 


Val 


Gly 


Gly 


Leu 


Glu Asp 
250 


Gin Leu Ser Val Arg 
255 


Trp 


Val 


Ser 


Pro 
260 


Pro 


Ala 


Leu 


Lys 


Asp 
265 


Phe Leu 


Phe Gin Ala Lys Tyr 
270 


b in 


T1 . 

1 le 


Arg 
275 


Tyr 


A 

Arg 


Val 


Glu 


Asp 
280 


Ser 


Val Asp 


Trp Lys Val Val Asp 
285 


Asp 


va 1 
290 


Ser 


Asn 


Gin 


Thr 


Ser 
295 


Cys 


Arg 


Leu Ala 


Gly Leu Lys Pro Gly 
300 


inr 


va I 


Tyr 


Phe 


Val 


Gin 


Val 


Arg 


Cys 


Asn Pro 


Phe Gly He Tyr Gly 


305 










310 








315 


320 


Ser 


Lys 


Lys 


Ala 


Gly 

325 


He 


Trp 


Ser 


Glu 


Trp Ser 
330 


His Pro Thr Ala Ala 
335 


ber 


Thr 


Pro 


Arg 
340 


Ser 


Glu 


Arg 


Pro 


Gly 
345 


Pro Gly 


Gly Gly Ala Cys Glu 
350 


Pro 


Arg 


Gly 
355 


Gly 


Glu 


Pro 


Ser 


Ser 
360 


Gly 


Pro Val 


Arg Arg Glu Leu Lys 
365 


Gin 


Phe 
370 


Leu 


Gly 


Trp 


Leu 


Lys 
375 


Lys 


His 


Ala Tyr 


Cys Ser Asn Leu Ser 
380 


Phe 


Arg 


Leu 


Tyr 


Asp 


Gin 


Trp 


Arg 


Ala 


Trp Met 


Gin Lys Ser His Lys 


385 










390 








395 


400 


Thr 


Arg 


Asn 


Gin 


Asp 
405 


Glu 


Gly 


He 


Leu 


Pro Ser 
410 


Gly Arg Arg Gly Thr 
415 


Ala 


Arg 


Gly 


Pro 
420 


Ala 


Arg 













(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1813 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 88... 1362 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

AATTCGGCAC GA6GGGCCTG CGTCCCGCGC CGTGCGCCAC CGCCGCCGAG CCGCAGCCCG 60 
CCGCGCGCCC CCGGCAGCGC CGGCCCC ATG CCC GCC GGC CGC CGG GGC CCC GCC 114 

Met Pro Ala Gly Arg Arg Gly Pro Ala 
1 5 

GCC CAA TCC GCG CGG CGG CCG CCG CCG TTG CTG CCC CTG CTG CTG CTC 162 
Ala Gin Ser Ala Arg Arg Pro Pro Pro Leu Leu Pro Leu Leu Leu Leu 
10 15 20 25 

TGC GTC CTC GGG GCG CCG CGA GCC GGA TCA GGA GCC CAC ACA GCT GTG 210 
Cys Val Leu Gly Ala Pro Arg Ala Gly Ser Gly Ala His Thr Ala Val 
30 35 40 

ATC AGT CCC CAG GAT CCC ACG CTT CTC ATC GGC TCC TCC CTG CTG GCC 258 
He Ser Pro Gin Asp Pro Thr Leu Leu He Gly Ser Ser Leu Leu Ala 
45 50 55 

ACC TGC TCA GTG CAC GGA GAC CCA CCA GGA GCC ACC GCC GAG GGC CTC 306 
Thr Cys Ser Val His Gly Asp Pro Pro Gly Ala Thr Ala Glu Gly Leu 
60 65 70 

TAC TGG ACC CTC AAT GGG CGC CGC CTG CCC CCT GAG CTC TCC CGT GTA 354 
Tyr Trp Thr Leu Asn Gly Arg Arg Leu Pro Pro Glu Leu Ser Arg Val 
75 80 85 

CTC AAC GCC TCC ACC TTG GCT CTG GCC CTG GCC AAC CTC AAT GGG TCC 402 
Leu Asn Ala Ser Thr Leu Ala Leu Ala Leu Ala Asn Leu Asn Gly Ser 
90 95 100 105 

AGG CAG CGG TCG GGG GAC AAC CTC GTG TGC CAC GCC CGT GAC GGC AGC 450 
Arg Gin Arg Ser Gly Asp Asn Leu Val Cys His Ala Arg Asp Gly Ser 
110 115 120 

ATC CTG GCT GGC TCC TGC CTC TAT GTT GGC CTG CCC CCA GAG AAA CCC 498 
He Leu Ala Gly Ser Cys Leu Tyr Val Gly Leu Pro Pro Glu Lys Pro 
125 130 135 
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GTC AAC ATC AGC TGC TGG TCC AAG AAC ATG AAG GAC TTG ACC TGC CGC 546 
Val Asn He Ser Cys Trp Ser Lys Asn Met Lys Asp Leu Thr Cys Arg 
140 145 150 

TGG ACG CCA GGG GCC CAC GGG GAG ACC TTC CTC CAC ACC AAC TAC TCC 594 
Trp Thr Pro Gly Ala His Gly Glu Thr Phe Leu His Thr Asn Tyr Ser 
155 160 165 

CTC AAG TAC AAG CTT AGG TGG TAT GGC CAG GAC AAC ACA TGT GAG GAG 642 
Leu Lys Tyr Lys Leu Arg Trp Tyr Gly Gin Asp Asn Thr Cys Glu Glu 
170 175 180 185 

TAC CAC ACA GTG GGG CCC CAC TCC TGC CAC ATC CCC AAG GAC CTG GCT 690 
Tyr His Thr Val Gly Pro His Ser Cys His He Pro Lys Asp Leu Ala 
190 195 200 

CTC TTT ACG CCC TAT GAG ATC TGG GTG GAG GCC ACC AAC CGC CTG GGC 738 
Leu Phe Thr Pro Tyr Glu He Trp Val Glu Ala Thr Asn Arg Leu Gly 
205 210 215 

TCT GCC CGC TCC GAT GTA CTC ACG CTG GAT ATC CTG GAT GTG GTG ACC 786 
Ser Ala Arg Ser Asp Val Leu Thr Leu Asp He Leu Asp Val Val Thr 
220 225 230 

ACG GAC CCC CCG CCC GAC GTG CAC GTG AGC CGC GTC GGG GGC CTG GAG 834 
Thr Asp Pro Pro Pro Asp Val His Val Ser Arg Val Gly Gly Leu Glu 
235 240 245 

GAC CAG CTG AGC GTG CGC TGG GTG TCG CCA CCC GCC CTC AAG GAT TTC 882 
Asp Gin Leu Ser Val Arg Trp Val Ser Pro Pro Ala Leu Lys Asp Phe 
250 255 260 265 

CTC TTT CAA GCC AAA TAC CAG ATC CGC TAC CGA GTG GAG GAC AGT GTG 930 
Leu Phe Gin Ala Lys Tyr Gin He Arg Tyr Arg Val Glu Asp Ser Val 
270 275 280 

GAC TGG AAG GTG GTG GAC GAT GTG AGC AAC CAG ACC TCC TGC CGC CTG 978 
Asp Trp Lys Val Val Asp Asp Val Ser Asn Gin Thr Ser Cys Arg Leu 
285 290 295 

GCC GGC CTG AAA CCC GGC ACC GTG TAC TTC GTG CAA GTG CGC TGC AAC 1026 
Ala Gly Leu Lys Pro Gly Thr Val Tyr Phe Val Gin Val Arg Cys Asn 
300 305 310 
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ccc 

Pro 


TTT 
Phe 
315 


GGC 
Gly 


ATC 
He 


TAT 
Tyr 


GGC 
Gly 


TCC 
Ser 
320 


AAG 
Lys 


AAA 
Lys 


GCC GGG ATC 
Ala Gly He 
325 


TGG AGT GAG TGG 
Trp Ser Glu Trp 


1074 


AGC 
Ser 
330 


CAC 
His 


CCC 
Pro 


ACA 
Thr 


GCC 
Ala 


GCC 
Ala 
335 


TCC 
Ser 


ACT 
Thr 


CCC 
Pro 


CGC AGT GAG 
Arg Ser Glu 
340 


CGC CCG GGC CCG 
Arg Pro Gly Pro 
345 


1122 


GGC 
Gly 


GGC 
Gly 


GGG 
Gly 


GCG 
Ala 


TGC 
Cys 
350 


GAA 
Glu 


CCG 
Pro 


CGG 
Arg 


GGC 
Gly 


GGA GAG CCG 
Gly Glu Pro 
355 


AGC TCG GGG CCG 
Ser Ser Gly Pro 
360 


1170 


GTG 
Val 


CGG 
Arg 


CGC 
Arg 


GAG 
Glu 
365 


CTC 
Leu 


AAG 
Lys 


CAG 
Gin 


TTC 
Phe 


CTG 
Leu 
370 


GGC TGG CTC 
Gly Trp Leu 


AAG AAG CAC GCG 
Lys Lys His Ala 
375 


1218 


TAC 
Tyr 


TGC 
Cys 


TCC 
Ser 
380 


AAC 
Asn 


CTC 
Leu 


AGC 
Ser 


TTC 
Phe 


CGC 
Arg 
385 


CTC 
Leu 


TAC GAC CAG 
Tyr Asp Gin 


TGG CGA GCC TGG 
Trp Arg Ala Trp 
390 


1266 


ATG 
Met 


CAG 
Gin 
395 


AAG 
Lys 


TCG 
Ser 


CAC 
His 


AAG 
Lys 


ACC 
Thr 
400 


CGC 
Arg 


AAC 
Asn 


CAG CAC AGG 
Gin His Arg 
405 


ACG AGG GGA TCC 
Thr Arg Gly Ser 


1314 


TGC 
Cys 
410 


CCT 
Pro 


CGG 
Arg 


GCA 
Ala 


GAC 
Asp 


GGG 
Gly 
415 


GCA 
Ala 


CGG 
Arg 


CGA 
Arg 


GAG GTC CTG 
Glu Val Leu 
420 


CCA GAT AAG CTG T 
Pro Asp Lys Leu 
425 


1363 



AGGGGCTCAG GCCACCCTCC CTGCCACGTG GAGACGCAGA GGCCGAACCC AAACTGGGGC 1423 

CACCTCTGTA CCCTCACTTC AGGGCACCTG AGCCACCCTC AGCAGGAGCT GGGGTGGCCC 1483 

CTGAGCTCCA ACGGCCATAA CAGCTCTGAC TCCCACGTGA GGCCACCTTT GGGTGCACCC 1543 

CAGTGGGTGT GTGTGTGTGT GTGAGGGTTG GTTGAGTTGC CTAGAACCCC TGCCAGGGCT 1603 

GGGGGTGAGA AGGGGAGTCA TTACTCCCCA TTACCTAGGG CCCCTCCAAA AGAGTCCTTT 1663 

TAAATAAATG AGCTATTTAG GTGCAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1723 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1783 

AAAAAAAAAA AAAAAAAAAT TTCCCGGGGA 1813 

(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Met Pro 


A "I - 

Ala 


Gly 


Arg 


Arg 


Gly 


Pro 


Ala 


Ala 


Gin 


Ser Ala Arg Arg Pro 


1 






5 










10 




15 


Pro Pro 


Leu 


Leu 


Pro 


Leu 


Leu 


Leu 


Leu 


Cys 


Val 


Leu Gly Ala Pro Arg 






20 










25 






30 


Ala Gly 


Ser 


Gly 


Ala 


His 


Thr 


Ala 


Val 


He 


Ser 


Pro Gin Asp Pro Thr 




35 










40 








45 


Leu Leu 


He 


Gly 


Ser 


Ser 


Leu 


Leu 


Ala 


Thr 


Cys 


Ser Val His Gly Asp 


50 










55 










60 


Pro Pro 


Gly 


Ala 


Thr 


Ala 


Glu 


Gly 


Leu 


Tyr 


Trp 


Thr Leu Asn Gly Arg 


65 








70 










75 


80 


Arg Leu 


Pro 


Pro 


Glu 


Leu 


Ser 


Arg 


Val 


Leu 


Asn 


Ala Ser Thr Leu Ala 








85 










90 




95 


Leu Ala 


Leu 


Ala 


Asn 


Leu 


Asn 


Gly 


Ser 


Arg 


Gin 


Arg Ser Gly Asp Asn 






100 










105 






110 


Leu Val 


Cys 


His 


Ala 


Arg 


Asp 


Gly 


Ser 


lie 


Leu 


Ala Gly Ser Cys Leu 




115 










120 








125 


Tyr Val 


Gly 


Leu 


Pro 


Pro 


Glu 


Lys 


Pro 


Val 


Asn 


lie Ser Cys Trp Ser 


130 










135 










140 


Lys Asn 


Met 


Lys 


Asp 


Leu 


Thr 


Cys 


Arg 


Trp 


Thr 


Pro Gly Ala His Gly 


145 








150 










155 


160 


Glu Thr 


Phe 


Leu 


His 


Thr 


Asn 


Tyr 


Ser 


Leu 


Lys 


Tyr Lys Leu Arg Trp 








165 










170 




175 


Tyr Gly 


Gin 


Asp 


Asn 


Thr 


Cys 


Glu 


Glu 


Tyr 


His 


Thr Val Gly Pro His 






180 










185 






190 


Ser Cys 


His 


He 


Pro 


Lys 


Asp 


Leu 


Ala 


Leu 


Phe 


Thr Pro Tyr Glu He 




195 










200 








205 


Trp Val 


Glu 


Ala 


Thr 


Asn 


Arg 


Leu 


Gly 


Ser 


Ala 


Arg Ser Asp Val Leu 


210 










215 










220 


Thr Leu 


Asp 


He 


Leu 


Asp 


Val 


Val 


Thr 


Thr 


Asp 


Pro Pro Pro Asp Val 


225 








230 










235 


240 


His Val 


Ser 


Arg 


Val 


Gly 


Gly 


Leu 


Glu 


Asp 


Gin 


Leu Ser Val Arg Trp 








245 










250 




255 


Val Ser 


Pro 


Pro 


Ala 


Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


Gin Ala Lys Tyr Gin 






260 










265 






270 


He Arg 


Tyr 


Arg 


Val 


Glu 


Asp 


Ser 


Val 


Asp 


Trp 


Lys Val Val Asp Asp 




275 










280 








285 


Val Ser 


Asn 


Gin 


Thr 


Ser 


Cys 


Arg 


Leu 


Ala 


Gly 


Leu Lys Pro Gly Thr 


290 










295 










300 


Val Tyr 


Phe 


Val 


Gin 


Val 


Arg 


Cys 


Asn 


Pro 


Phe 


Gly He Tyr Gly Ser 


305 








310 










315 


320 
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1 wr 


Lys 


Ala 


Gly 


I le 


Trp 


Ser 


Glu 


Trp 


Ser His Pro Thr Ala 


Ala Ser 




















O H A 

330 


335 


i nr 


rPO 


Arg 


Ser 


Glu 


Arg 


Pro 


Gly 


Pro 


Gly Gly Gly Ala Cys 


Glu Pro 


















^ AC 

345 


350 




Arg 


Gly 


Gly 


Glu 


Pro 


Ser 


Ser 


Gly 


Pro 


Val Arg Arg Glu Leu 


Lys Gin 






355 










360 




365 


Phe 


Leu 


Gly 


Trp 


Leu 


Lys 


Lys 


His 


Ala 


Tyr Cys Ser Asn Leu 


Ser Phe 




370 










375 






380 




Arg 


Leu 


Tyr 


Asp 


Gin 


Trp 


Arg 


Ala 


Trp 


Met Gin Lys Ser His 


Lys Thr 


385 










390 








395 


400 


Arg 


Asn 


Gin 


His 


Arg 


Thr 


Arg 


Gly 


Ser 


Cys Pro Arg Ala Asp 


Gly Ala 










405 










410 


415 


Arg 


Arg 


Glu 


Val 


Leu 


Pro 


Asp 


Lys 


Leu 












420 










425 







(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1724 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 159... 1433 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

GAATTCGGCA CGAGGAATTT CGGCTGCTCA GACTTGCTCC GGCCTTCGCT GTCCGCGCCC 60 
AGTGACGTGC GTGCGGACCC AAACCCCAAT CTGCACCCCG CAGAGTCGCC CCCGCCCCAT 120 
ACCGGCGTTG CAGTCACCGC CCGTTGCGCG CCACCCCC ATG CCC GCC GGT GGC CCG 176 

Met Pro Ala Gly Gly Pro 
1 5 

GGC CCC GCC GCC CAA TCC GCG CGG CGG CCG CCG CGG CGG CTC TCC TCG 224 
Gly Pro Ala Ala Gin Ser Ala Arg Arg Pro Pro Arg Arg Leu Ser Ser 
10 15 20 

CTG TGG TCG CCT CTG TTG CTC TGT GTT CTC GGG GTG CCT CAG GGC GGA 272 
Leu Trp Ser Pro Leu Leu Leu Cys Val Leu Gly Val Pro Gin Gly Gly 
25 30 35 
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TC6 GGA GCC CAC ACA GCT GTG ATC AGT CCC CAG GAC CCC ACT CTT CTC 320 

Ser Gly Ala His Thr Ala Val He Ser Pro Gin Asp Pro Thr Leu Leu 

40 45 50 

ATC GGA TCC TCC CTT CAT GCT ACG TGC TCT ATA CAT GGA GAC ACA CCG 368 

He Gly Ser Ser Leu His Ala Thr Cys Ser He His Gly Asp Thr Pro 
55 60 65 70 

GGG GCC ACT GCT GAG GGC CTC TAC TGG ACC CTC AAC GGC CGC CGC CTG 416 

Gly Ala Thr Ala Glu Gly Leu Tyr Trp Thr Leu Asn Gly Arg Arg Leu 
75 80 85 

CCC TCA GAG CTG TCC CGT CTC CTC AAC ACC TCC ACC CTG GCC TTG GCC 464 

Pro Ser Glu Leu Ser Arg Leu Leu Asn Thr Ser Thr Leu Ala Leu Ala 
90 95 100 

CTG GCT AAC CTT AAT GGG TCC AGG CAG CAG TCA GGG GAC AAT CTG GTG 512 

Leu Ala Asn Leu Asn Gly Ser Arg Gin Gin Ser Gly Asp Asn Leu Val 
105 no 115 

TGT CAC GCC CGA GAT GGC AGC ATT CTG GCT GGT TCC TGC CTC TAT GTT 560 

Cys His Ala Arg Asp Gly Ser He Leu Ala Gly Ser Cys Leu Tyr Val 

120 125 130 

GGT CTG CCC CCG GAG AAG CCC TTT AAC ATC AGC TGC TGG TCC CGG AAC 608 

Gly Leu Pro Pro Glu Lys Pro Phe Asn lie Ser Cys Trp Ser Arg Asn 
135 140 145 150 

ATG AAG GAC CTG ACA TGC CGT TGG ACA CCG GGT GCA CAT GGG GAG ACA 656 

Met Lys Asp Leu Thr Cys Arg Trp Thr Pro Gly Ala His Gly Glu Thr 
155 160 165 

TTC CTA CAC ACC AAC TAC TCC CTC AAG TAC AAG CTG AGG TGG TAT GGT 704 

Phe Leu His Thr Asn Tyr Ser Leu Lys Tyr Lys Leu Arg Trp Tyr Gly 
170 175 180 

CAG GAC AAC ACA TGT GAG GAA TAT CAC ACT GTG GGC CCT CAC TCG TGC 752 

Gin Asp Asn Thr Cys Glu Glu Tyr His Thr Val Gly Pro His Ser Cys 
185 190 195 

CAT ATC CCC AAA GAC CTG GCC CTC TTC ACG CCC TAT GAG ATC TGG GTG 800 

His He Pro Lys Asp Leu Ala Leu Phe Thr Pro Tyr Glu He Trp Val 

200 205 210 
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GAA GCC ACC AAT CGC CTG GGT TCA GCG AGA TCT GAC GTG CTC ACA CTG 848 
Glu Ala Thr Asn Arg Leu Gly Ser Ala Arg Ser Asp Val Leu Thr Leu 
215 220 225 230 

GAT GTC CTG GAC GTG GTG ACC ACG GAC CCT CCA CCC GAC GTG CAC GTG 896 
Asp Val Leu Asp Val Val Thr Thr Asp Pro Pro Pro Asp Val His Val 
235 240 245 

AGC CGC GTT GGG GGC CTG GAG GAC CAG CTG AGT GTG CGC TGG GTC TCA 944 
Ser Arg Val Gly Gly Leu Glu Asp Gin Leu Ser Val Arg Trp Val Ser 
250 255 260 

CCA CCA GCT CTC AAG GAT TTC CTC TTC CAA GCC AAA TAC CAG ATT CGC 992 
Pro Pro Ala Leu Lys Asp Phe Leu Phe Gin Ala Lys Tyr Gin He Arg 
265 270 275 

TAC CGC GTG GAG GAC AGC GTG GAC TGG AAG GTG GTG GAT GAC GTC AGC 1040 
Tyr Arg Val Glu Asp Ser Val Asp Trp Lys Val Val Asp Asp Val Ser 
280 285 290 

AAC CAG ACC TCC TGC CGT CTC GCG GGC TTG AAG CCC GGC ACC GTT TAC 1088 
Asn Gin Thr Ser Cys Arg Leu Ala Gly Leu Lys Pro Gly Thr Val Tyr 
295 300 305 310 

TTC GTC CAA GTT CGT TGT AAC CCA TTC GGG ATC TAT GGG TCG AAA AAG 1136 
Phe Val Gin Val Arg Cys Asn Pro Phe Gly He Tyr Gly Ser Lys Lys 
315 320 325 

GCG GGA ATC TGG AGC GAG TGG AGC CAC CCC ACC GCT GCC TCC ACC CCT 1184 
Ala Gly He Trp Ser Glu Trp Ser His Pro Thr Ala Ala Ser Thr Pro 
330 335 340 

CGA AGT GAG CGC CCG GGC CCG GGC GGC GGG GTG TGC GAG CCG CGG GGC 1232 
Arg Ser Glu Arg Pro Gly Pro Gly Gly Gly Val Cys Glu Pro Arg Gly 
345 350 355 

GGC GAG CCT AGC TCG GGC CCG GTG CGG CGC GAG CTC AAG CAG TTC CTC 1280 
Gly Glu Pro Ser Ser Gly Pro Val Arg Arg Glu Leu Lys Gin Phe Leu 
360 365 370 

GGC TGG CTC AAG AAG CAC GCG TAC TGC TCG AAC CTT AGC TTC CGC CTG 1328 
Gly Trp Leu Lys Lys His Ala Tyr Cys Ser Asn Leu Ser Phe Arg Leu 
375 380 385 390 



WO 98/49307 PCMJS98/08865 



75 



TAC GAC CAG TGG CGT GCT TGG ATG CAG AAG TCA CAC AAG ACC CGA AAC 
Tyr Asp Gin Trp Arg Ala Trp Met Gin Lys Ser His Lys Thr Arg Asn 
395 400 405 



1376 



CAG GAC GAG GGG ATC CTG CCC TCG GGC AGA CGG GGT GCG GCG AGA GGT 1424 
Gin Asp Glu Gly He Leu Pro Ser Gly Arg Arg Gly Ala Ala Arg Gly 
410 415 420 

CCT GCC GGC TAAACTCTGA GGATAGGCCA TCCTCCTGCT GGATGCAGAC CTGGAGGCT 1482 
Pro Ala Gly 
425 



CACCTGAACT GGAGACCATC TGTACTGTCA CTTTGGGGCA ATGAAGAAAC AAACCAGGGG 1542 

CTGGGGCACA ATGAGCTCCC ACAACCACAG CTTTGGCCAC ATGATGGTCA ACTTTGGATG 1602 

TACCCCAATA TGGGTAGGGT TGGAGTAATG ACAAGGGTTA TGCAGGACCC TCCAAGAGTC 1662 

TCTTTGAATA AATAAGAAAA GAGTTGTTCA GGAAAAAAAA AAAAAAAAAA AATAGCGGCC 1722 

GC 1724 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Met 


Pro 


Ala 


Gly 


Gly 


Pro 


Gly 


Pro 


Ala 


Ala 


Gin 


Ser Ala Arg Arg Pro 


1 








5 










10 




15 


Pro 


Arg 


Arg 


Leu 


Ser 


Ser 


Leu 


Trp 


Ser 


Pro 


Leu 


Leu Leu Cys Val Leu 








20 










25 






30 


Gly 


Val 


Pro 


Gin 


Gly 


Gly 


Ser 


Gly 


Ala 


His 


Thr 


Ala Val He Ser Pro 






35 










40 








45 


Gin 


Asp 


Pro 


Thr 


Leu 


Leu 


He 


Gly 


Ser 


Ser 


Leu 


His Ala Thr Cys Ser 




50 










55 










60 


He 


His 


Gly 


Asp 


Thr 


Pro 


Gly 


Ala 


Thr 


Ala 


Glu 


Gly Leu Tyr Trp Thr 


65 










70 










75 


80 


Leu 


Asn 


Gly 


Arg 


Arg 


Leu 


Pro 


Ser 


Glu 


Leu 


Ser 


Arg Leu Leu Asn Thr 










85 










90 




95 


Ser 


Thr 


Leu 


Ala 


Leu 


Ala 


Leu 


Ala 


Asn 


Leu 


Asn 


Gly Ser Arg Gin Gin 








100 










105 






110 
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c or pi w 

oB\ biy 


Asp 


Asn 


Leu 


Va 1 


o, 

Cys 


His 


Ala 


Arg Asp Gly 


Ser He Leu Ala 




lib 










120 






i or* 

125 


uiy ber 


P wr- 

Lys 


Leu 


Tyr 


Val 


Gly 


Leu 


Pro 


Pro Glu Lys 


Pro Phe Asn He 


1 on 
loU 










135 






140 




Ser Cys 


I rp 


ber 


Arg 


Asn 


Met 


Lys 


Asp 


Leu Thr Cys 


Arg Trp Thr Pro 


14b 








150 








155 


160 


PI AK 

Sly Ala 


His 


Gly 


Glu 


Thr 


Phe 


Leu 


His 


Thr Asn Tyr 


Ser Leu Lys Tyr 








165 










170 


175 


Lys Leu 


Arg 


Trp 


Tyr 


Gly 


Gin 


Asp 


Asn 


Thr Cys Glu 


Glu Tyr His Thr 






i on 

180 










185 




190 


Val Gly 


Pro 


His 


Ser 


Cys 


His 


He 


Pro 


Lys Asp Leu 


Ala Leu Phe Thr 




195 










200 






205 


Pro Tyr 


Glu 


He 


Trp 


Val 


Glu 


Ala 


Thr 


Asn Arg Leu 


Gly Ser Ala Arg 


£lU 










215 






220 


Ser Asp 


Val 


Leu 


Thr 


Leu 


Asp 


Val 


Leu 


Asp Val Val 


Thr Thr Asp Pro 


ooc 








230 








235 


240 


Pro Pro 


Asp 


Val 


His 


Val 


Ser 


Arg 


Val 


Gly Gly Leu 


Glu Asp Gin Leu 








245 










250 


255 


ber Va I 


Arg 


Trp 


Val 


Ser 


Pro 


Pro 


Ala 


Leu Lys Asp 


Phe Leu Phe Gin 






260 










265 


270 


All 1 W** 

Ala Lys 


Tyr 


Gin 


He 


Arg 


Tyr 


Arg 


Val 


Glu Asp Ser 


Val Asp Trp Lys 




07C 

£/o 










280 






285 


val val 


Asp 


Asp 


Val 


Ser 


Asn 


Gin 


Thr 


Ser Cys Arg 


Leu Ala Gly Leu 












295 






300 


Lys Pro 


Gly 


Thr 


Val 


Tyr 


Phe 


Val 


Gin 


Val Arg Cys 


Asn Pro Phe Gly 


one 
JUo 








310 








315 


320 


He Tyr 


Gly 


Ser 


Lys 


Lys 


Ala 


Gly 


He 


Trp Ser Glu 


Trp Ser His Pro 








325 










330 


335 


Thr Ala 


Ala 


Ser 


Thr 


Pro 


Arg 


Ser 


Glu 


Arg Pro Gly 


Pro Gly Gly Gly 






J4U 










345 




350 


va I Cys 


Glu 


Pro 


Arg 


Gly 


Gly 


Glu 


Pro 


Ser Ser Gly 


Pro Val Arg Arg 




000 










obU 






o r r* 

365 


Glu Leu 


Lys 


Gin 


Phe 


Leu 


Gly 


Trp 


Leu 


Lys Lys His 


Ala Tyr Cys Ser 


370 










375 






380 




Asn Leu 


Ser 


Phe 


Arg 


Leu 


Tyr 


Asp 


Gin 


Trp Arg Ala 


Trp Met Gin Lys 


385 








390 








395 


400 


Ser His 


Lys 


Thr 


Arg 


Asn 


Gin 


Asp 


Glu 


Gly He Leu 


Pro Ser Gly Arg 








405 










410 


415 


Arg Gly 


Ala 


Ala 


Arg 


Gly 


Pro 


Ala 


Gly 







420 425 



(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 259 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

AGGATTTCCT CTTTCAAGCC AAATACCAGA TCCGCTANCG AGTGGAGGAN AGTGTGGANT 60 

GGAAGGTGGT GGANGATGTG AGCAACCAGA CCTTCTGCCG CTGGNCGGCC TGAAACCCGG 120 

CANCGTGTAC TTCGTGCAAG TGCGCTGCAA NCCCTTTGGC ATCTATGGCT NCAAGAAAGC 180 

CGGGATCTNG AGTGAGTGGA GCCANCCCAC AGCCGGCTTC ANTTCCCGCA GTGAGCGNCN 240 

GGGCCCGGGN GGNGGGAAG 259 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 
CGCGAGCTCA AGCAGTTCCT G 21 
(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 

GNACACGCCG NTATAGCTNG CCCCTGCTGC TGCTGCTCTG CGTCCTCGGG GCNCGCGAGC 60 

GGATTCAGGA GCCCACACAG CTGTGATCAG TCCCCAGGAT CCCACGCTTC TCATCGGCTC 120 

CTCCCTGCTG GCCACCTGCT CAGTGCACGG AGACCCACCA GGAGCCACCG CCGAGGGCCT 180 

CTACTGGACC CTCAACGGGC GCCGCTGCCC 210 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 : 
CCACCTAAGC TTGTACTTGA GG 22 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
AGCCCACACA GCTGTGATCA G 21 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0.12: 
GACAGAGCAC AGAATTCACT AGTGAGCTCT M I N I UM TTTT 44 
(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
TATGGCCAGG ACAACACA 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14 
ATAGGGCGTA AAGAGAGC 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:15 
TCACATCGTC CACCACCTTC CAGTCCA 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Other 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GAGCACAGAA TTCACTACTC GAGGCGGCCG CTTTTTTTTT MINIMI 49 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



Gly 


Ser 


Gly 


Ala 


His 


Thr 


Ala 


Val 


He 


Ser 


Pro 


Gin 


Asp 


Pro 


Thr Leu 


1 








5 










10 








15 


Leu 


He 


Gly 


Ser 


Ser 


Leu 


Leu 


Ala 


Thr 


Cys 


Ser 


Val 


His 


Gly 


Asp Pro 








20 










25 










30 


Pro 


Gly 


Ala 


Thr 


Ala 


Glu 


Gly 


Leu 


Tyr 


Trp 


Thr 


Leu 


Xaa 


Gly 


Arg Arg 






35 










40 










45 




Leu 


Pro 


Pro 


Glu 


Leu 


Ser 


Arg 


Val 


Leu 


Asn 


Ala 


Ser 


Thr 


Leu 


Ala Leu 




50 










55 










60 








Ala 


Leu 


Ala 


Asn 


Leu 


Asn 


Gly 


Ser 


Arg 


Gin 


Arg 


Ser 


Gly 


Asp 


Asn Leu 


65 










70 










75 






80 


Val 


Cys 


His 


Ala 


Arg 


Asp 


Gly 


Ser 


He 


Leu 


Ala 


Gly 


Ser 


Cys 


Leu Tyr 










85 










90 










95 


Val 


Gly 


Leu 


Pro 


Pro 


Glu 


Lys 


Pro 


Val 


Asn 


He 


Ser 


Cys 


Trp 


Ser Lys 








100 










105 










110 


Asn 


Met 


Lys 


Asp 


Leu 


Thr 


Cys 


Arg 


Trp 


Thr 


Pro 


Gly 


Ala 


His 


Gly Glu 






115 










120 










125 




Thr 


Phe 


Leu 


His 


Thr 


Asn 


Tyr 


Ser 


Leu 


Lys 


Tyr 


Lys 


Leu 


Arg 


Trp Tyr 




130 










135 










140 






Gly 


Gin 


Asp 


Asn 


Thr 


Cys 


Glu 


Glu 


Tyr 


His 


Thr 


Val 


Gly 


Pro 


His Ser 


145 










150 










155 








160 


Cys 


His 


He 


Pro 


Lys 


Asp 


Leu 


Ala 


Leu 


Phe 


Thr 


Pro 


Tyr 


Glu 


He Trp 










165 










170 










175 


Val 


Glu 


Ala 


Thr 


Asn 


Arg 


Leu 


Gly 


Ser 


Ala 


Arg 


Ser 


Asp 


Val 


Leu Thr 








180 










185 








190 
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1 Ql 1 

LcU 


Arn 

M5p 




Leu 


Asp 


va i 


va i 


1 nr 


I nr 


Asp Pro Pro 


Pro Glu Val 


His 
















^UU 






one 




Va 1 


oer 


Mrg 


va i 


biy 


biy 


Leu 


Glu 


Asp 


Gin Leu Ser 


Val Arg Trp 


Val 














lib 






ion 

220 




Ser 


KPO 


pro 


a i a 


Leu 


Lys 


Asp 


Phe 


Leu 


Phe Gin Ala 


Lys Tyr Gin 


He 


ceo 










2JU 








one 

235 




240 


Arg 


Tyr 


Arg 


va l 


Glu 


Asp 


Ser 


Val 


Asp 


Trp Lys Val 


Val Asp Asp 


Val 




















250 


or r 

255 




Ser 


Asn 


b in 


inr 


Ser 


Cys 


Arg 


Leu 


Ala 


Gly Leu Lys 


Pro Gly Thr 


Val 








ocn 
cov 










265 




ATA 

270 




Tyr 


rne 


va i 


Gin 


va I 


Arg 


Cys 


Asn 


Pro 


Phe Gly He 


Tyr Gly Ser 


Lys 






c/o 










280 






285 




Lys 


Ala 


Gly 


T T _ 

He 


Trp 


Ser 


Glu 


Trp 


Ser 


His Pro Thr 


Ala Ala Ser 


Thr 














295 






r\ r\ r\ 

300 






KPO 


Arg 


Ser 


Glu 


Arg 


Pro 


Gly 


Pro 


Gly 


Gly Gly Ala 


Cys Glu Pro 


Arg 


one 
JUS 










310 








315 




320 


Gly 


Gly 


Glu 


Pro 


Ser 


Ser 


Gly 


Pro 


Val 


Arg Arg Glu 


Leu Lys Gin 


Phe 










J2b 










330 


335 




Leu 


Gly 


Trp 


Leu 


Lys 


Lys 


His 


Ala 


Tyr 


Cys Ser Asn 


Leu Ser Phe 


Arg 








340 










345 




350 


Leu 


Tyr 


Asp 


Gin 


Trp 


Arg 


Ala 


Trp 


Met 


Gin Lys Ser 


His Lys Thr 


Arg 






355 










360 






365 


Asn 


Gin 


Asp 


Glu 


Gly 


He 


Leu 


Pro 


Ser 


Gly Arg Arg 


Gly Thr Ala 


Arg 




370 










375 






380 




Gly 


Pro 


Ala 


Arg 



















385 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 392 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Gly Ser Gly Ala His Thr Ala Val He Ser Pro Gin Asp Pro Thr Leu 
1 5 10 15 

Leu He Gly Ser Ser Leu Leu Ala Thr Cys Ser Val His Gly Asp Pro 

20 25 30 

Pro Gly Ala Thr Ala Glu Gly Leu Tyr Trp Thr Leu Asn Gly Arg Arg 
35 40 45 
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Leu Pro 


Pro 


Glu 


Leu 


Ser 


Arg 


Val 












55 




Ala Leu 


Ala 


Asn 


Leu 


Asn 


Gly 


Ser 


bb 








70 




Va I Cys 


His 


Ala 


Arg 


Asp 


Gly 


Ser 








85 








Val Gly 


Leu 


Pro 


Pro 


Glu 


Lys 


Pro 






100 










Asn Met 


Lys 


Asp 


Leu 


Thr 


Cys 


Arg 




115 










120 


Thr Phe 


Leu 


His 


Thr 


Asn 


Tyr 


Ser 


130 










135 




Gly Gin 


Asp 


Asn 


Thr 


Cys 


Glu 


Glu 


145 








150 






Cys His 


He 


Pro 


Lys 


Asp 


Leu 


Ala 








165 








Val Glu 


Ala 


Thr 


Asn 


Arg 


Leu 


Gly 






180 










Leu Asp 


lie 


Leu 


Asp 


Val 


Val 


Thr 




195 










200 


Val Ser 


Arg 


Val 


Gly 


Gly 


Leu 


Glu 


210 










215 




Ser Pro 


Pro 


Ala 


Leu 


Lys 


Asp 


Phe 


225 








230 




Arg Tyr 


Arg 


Val 


Glu 


Asp 


Ser 


Val 








245 








Ser Asn 


Gin 


Thr 


Ser 


Cys 


Arg 


Leu 






260 










Tyr Phe 


Val 


Gin 


Val 


Arg 


Cys 


Asn 




275 










280 


Lys Ala 


Gly 


He 


Trp 


Ser 


Glu 


Trp 












295 


Pro Arg 


Ser 


Glu 


Arg 


Pro 


Gly 


Pro 


one 
olio 








310 






bly bly 


pi , . 
Glu 


Pro 


Ser 


Ser 


Gly 


Pro 








325 








Leu Gly 


Trp 


Leu 


Lys 


Lys 


His 


Ala 






340 










Leu Tyr 


Asp 


Gin 


Trp 


Arg 


Ala 


Trp 




355 










360 


Asn Gin 


His 


Arg 


Thr 


Arg 


Gly 


Ser 


370 










375 




Arg Glu 


Val 


Leu 


Pro 


Asp 


Lys 


Leu 


385 








390 







82 




Leu 


Asn Ala 


Ser Thr Leu Ala Leu 






60 


Arg 


Gin Arg 


Ser Glv Asd Asn Lpij 




75 


80 


He 


Leu Ala 


Gly Ser Cys Leu Tvr 




90 


95 


Val 


Asn He 


Ser Cvs TrD Ser I v<; 


105 




110 


TrD 


Thr Pro 


Gly Ala His Glv Glu 






125 


Leu 


Lys Tvr 


Lvs Leu Ara Trn Tvr 






140 


Tyr 


His Thr 


Val Glv Pro His Spr 




155 


160 


Leu 


Phe Thr 


Pro Tvr Glu Tip Trn 




170 


175 


Ser 


Ala Arg 


Ser A^n Val 1 pij Thr 

1 no yj v ci i LCU Mil 


185 




190 


Thr 


Asp Pro 


Pro Pro Asd Val Hi<; 

i i \j no yj v u I 1 I 1 o 






205 


Asp 


Gin Leu 


Ser Val Ara TrD Val 






220 


Leu 


Phe Gin 


Ala Lvs Tvr Gin Tip 

n I U 1— J O 1 Jf 1 VJ III 1IC 




235 


240 


Asp 


Tro Lvs 


Val Val Asn Asd Val 

vui vui now nou Vu I 




250 




Ala 


Gly Leu 


Lvs Pro Glv Thr Val 

*-~Jf O 1 1 U \J 1 Jf III) VU 1 


265 




270 


Pro 


Phe Gly 


He Tvr Gl v Spr 1 v<; 






285 


Ser 


His Pro 


Thr Ala Ala Ser Thr 

■ in niu niu *J l III! 






300 


Gly 


Gly Gly 


Ala Cvs Glu Pro Ara 




315 


320 


Val 


Arg Arg 


Glu Leu Lys Gin Phe 




330 


335 


Tyr 


Cys Ser 


Asn Leu Ser Phe Arg 


345 




350 


Met 


Gin Lys 


Ser His Lys Thr Arg 






365 


Cys 


Pro Arg 


Ala Asp Gly Ala Arg 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 385 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 





(xi) SEQUENCE 


DESCRIPTION: 


: SEQ ID 


NO: 19: 




Ala 


His 


Thr 


Ala 


Val 


lie 


Ser 


Pro 


Gin 


Asp 


Pro 


Thr 


Leu Leu He Gly 


1 








5 










10 






15 


<** 

Ser 


Ser 


Leu 


His 


Ala 


Thr 


Cys 


Ser 


He 


His 


Gl v 


A^n 

no yj 


Thr Pro Gly Ala 








20 










25 








30 


Thr 


Ala 


Glu 


Gly 


Leu 


Tyr 


TrD 


Thr 


Leu 


Asn 


Glv 

U 1 Jf 


Ara 


Arg Leu Pro Ser 






35 










40 










45 


Glu 


Leu 


Ser 


Arg 


Leu 


Leu 


Asn 


Thr 


Ser 


Thr 


Leu 


Ala 


Leu Ala Leu Ala 




50 










55 










60 




Asn 


Leu 


Asn 


Gly 


Ser 


Arq 


Gin 


Gin 


Ser 


Gly 


Asp 


Asn 


Leu Val Cys His 


CXI 

65 










70 










75 




80 


Ala 


Arg 


Asp 


Gly 


Ser 


He 


Leu 


Ala 


Gly 


Ser 


Cys 


Leu 


Tyr Val Gly Leu 










85 










90 






95 


Pro 


Pro 


Glu 


Lys 


Pro 


Phe 


Asn 


He 


Ser 


Cys 


Trp 


Ser 


Arg Asn Met Lys 








100 










105 








110 


Asp 


Leu 


Thr 


Cys 


Arg 


Trp 


Thr 


Pro 


Gly 


Ala 


His 


Gly 


Glu Thr Phe Leu 






115 










120 










125 


His 


Thr 


Asn 


Tyr 


Ser 


Leu 


Lys 


Tyr 


Lys 


Leu 


Arg 


Trp 


Tyr Gly Gin Asp 




130 










135 










140 


Asn 


Thr 


Cys 


Glu 


Glu 


Tyr 


His 


Thr 


Val 


Gly 


Pro 


His 


Ser Cys His He 


145 










150 










155 




160 


Pro 


Lys 


Asp 


Leu 


Ala 


Leu 


Phe 


Thr 


Pro 


Tyr 


Glu 


He 


Trp Val Glu Ala 










165 










170 






175 


Thr 


Asn 


Arg 


Leu 


Gly 


Ser 


Ala 


Arg 


Ser 


Asp 


Val 


Leu 


Thr Leu Asp Val 








180 










185 








190 


Leu 


Asp 


Val 


Val 


Thr 


Thr 


Asp 


Pro 


Pro 


Pro 


Asp 


Val 


His Val Ser Arg 






195 










200 










205 


Val 


Gly 


Gly 


Leu 


Glu 


Asp 


Gin 


Leu 


Ser 


Val 


Arg 


Trp 


Val Ser Pro Pro 




210 










215 










220 




Ala 


Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


Gin 


Ala 


Lys 


Tyr 


Gin 


He Arg Tyr Arg 


225 










230 










235 




240 


Val 


Glu 


Asp 


Ser 


Val 


Asp 


Trp 


Lys 


Val 


Val 


Asp 


Asp 


Val Ser Asn Gin 



245 250 255 
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Thr 




Cys 


Arg 


Leu 


Ala 


bly 


Leu 


Lys 


Pro Gly 


Thr Val Tyr Phe Val 


















CUD 




ll\J 


Gin 
u 1 it 


Vd 1 


Arg 


Cys 


Asn 


rro 


rtie 


Gly 


1 le 


Tyr Gly 


Ser Lys Lys Ala Gly 






















285 




i rp 


Ser 


b 1 u 


Trp 


Ser 


HIS 


Pro 


Thr 


Ala Ala 


Ser Thr Pro Arg Ser 






















oUU 


b 1 Li 


Arg 


rPO 


b iy 


rro 


bly 


Gly 


Gly 


Val 


Cys Glu 


Pro Arg Gly Gly Glu 


305 










Olu 












Pro 


Ser 


Ser 


Gly 


Pro 


Val 


Arg 


Arg 


Glu 


Leu Lys 


Gin Phe Leu Gly Trp 










325 










330 


335 


Leu 


Lys 


Lys 


His 


Ala 


Tyr 


Cys 


Ser 


Asn 


Leu Ser 


Phe Arg Leu Tyr Asp 








340 










345 




350 


Gin 


Trp 


Arg 


Ala 


Trp 


Met 


Gin 


Lys 


Ser 


His Lys 


Thr Arg Asn Gin Asp 






355 










360 






365 


Glu 


Gly 


He 


Leu 


Pro 


Ser 


Gly 


Arg 


Arg 


Gly Ala 


Ala Arg Gly Pro Ala 



370 375 380 



Gly 
385 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 385 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:20: 



Ala 


His 


Thr 


Ala 


Val 


He 


Ser 


Pro 


Gin 


Asp 


Pro 


Thr Leu Leu He Gly 


1 








5 










10 




15 


Ser 


Ser 


Leu 


Leu 


Ala 


Thr 


Cys 


Ser 


Val 


His 


Gly 


Asp Pro Pro Gly Ala 








20 










25 






30 


Thr 


Ala 


Glu 


Gly 


Leu 


Tyr 


Trp 


Thr 


Leu 


Xaa 


Gly 


Arg Arg Leu Pro Pro 






35 










40 








45 


Glu 


Leu 


Ser 


Arg 


Val 


Leu 


Asn 


Ala 


Ser 


Thr 


Leu 


Ala Leu Ala Leu Ala 




50 










55 










60 


Asn 


Leu 


Asn 


Gly 


Ser 


Arg 


Gin 


Arg 


Ser 


Gly 


Asp 


Asn Leu Val Cys His 


65 










70 










75 


80 


Ala 


Arg 


Asp 


Gly 


Ser 


He 


Leu 


Ala 


Gly 


Ser 


Cys 


Leu Tyr Val Gly Leu 










85 










90 




95 


Pro 


Pro 


Glu 


Lys 


Pro 


Val 


Asn 


He 


Ser 


Cys 


Trp 


Ser Lys Asn Met Lys 








100 










105 






110 
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Ac n 
Mbp 


1 a 1 1 
LcU 


Thr 

i nr 


Lys 


Arg 


Trp 


i nr 


Pro 


PI u 

Gly 


Ala 


His Gly Glu Thr Phe Leu 






IK 
110 










1 on 
lev 






125 


Ul-i c 
(11 S 


i nr 


Asn 


Tyr 


ser 


Leu 


Lys 


Tyr 


Lys 


Leu 


Arg Trp Tyr Gly Gin Asp 




i ^n 










1 or: 

Ub 








140 


Msn 


i nr 


Cys 


pi 1 1 

b IU 


pi 1 1 

blU 


Tyr 


His 


Thr 


Val 


Gly 


Pro His Ser Cys His He 


1 AC 
140 










Ibl) 










155 160 


HTO 


Lys 


Asp 


Leu 


Ala 


Leu 


Phe 


Thr 


Pro 


Tyr 


Glu He Trp Val Glu Ala 










1 

lob 










170 


175 


i nr 


Asn 


Arg 


Leu 


Gly 


Ser 


Ala 


Arg 


Ser 


Asp 


Val Leu Thr Leu Asp He 








i on 










185 




190 


Leu 


Asp 


Va 1 


va 1 


Thr 


Thr 


Asp 


Pro 


Pro 


Pro 


Glu Val His Val Ser Arg 
















200 






205 


va I 


bly 


PI w 

Gly 


Leu 


Glu 


Asp 


Gin 


Leu 


Ser 


Val 


Arg Trp Val Ser Pro Pro 




clu 










215 








220 


Ala 


Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


Gin 


Ala 


Lys 


Tyr Gin lie Arg Tyr Arg 


ooc 

CCD 










230 










235 240 


va i 


b IU 


Asp 


Ser 


Val 


Asp 


Trp 


Lys 


Val 


Val 


Asp Asp Val Ser Asn Gin 










d4b 










or A 

250 


255 


Thin 

i nr 


Ser 


bys 


Arg 


Leu 


Ala 


Gly 


Leu 


Lys 


Pro 


Gly Thr Val Tyr Phe Val 








ocn 










o/* r* 

265 




270 


PI r\ 

b I n 


va i 


Arg 


Pt 

Cys 


Asn 


Pro 


Phe 


Gly 


He 


Tyr 


Gly Ser Lys Lys Ala Gly 






£/b 










or* a 

280 






285 


T "I - 

1 IG 


l rp 


Ser 


Glu 


Trp 


Ser 


His 


Pro 


Thr 


Ala 


Ala Ser Thr Pro Arg Ser 




oon 










oo r~ 

295 








300 


bill 


Arg 


Pro 


Gly 


Pro 


Gly 


Gly 


Gly 


Ala 


Cys 


Glu Pro Arg Gly Gly Glu 


30S 




















oi r o o a 

olb 320 


Pro 


Ser 


Ser 


Gly 


Pro 


Val 


Arg 


Arg 


Glu 


Leu 


Lys Gin Phe Leu Gly Trp 










325 










330 


335 


Leu 


Lys 


Lys 


His 


Ala 


Tyr 


Cys 


Ser 


Asn 


Leu 


Ser Phe Arg Leu Tyr Asp 








340 










345 




350 


Gin 


Trp 


Arg 


Ala 


Trp 


Met 


Gin 


Lys 


Ser 


His 


Lys Thr Arg Asn Gin Asp 






355 










360 






365 


Glu 


Gly 


He 


Leu 


Pro 


Ser 


Gly 


Arg 


Arg 


Gly 


Thr Ala Arg Gly Pro Ala 



370 375 380 



Arg 
385 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 303 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 



Ala His 


Thr 


Ala 


Val 


He 


Ser 


Pro Gin 


Asp Pro 


Thr 


Leu Leu He Gly 


i 






5 








10 




15 


Ser Ser 


Leu 


Leu 


Ala 


Thr 


Cys 


Ser Val 


His Gly 


Asp 


Pro Pro Gly Ala 






20 








25 






30 


Thr Ala 


Glu 


Gly 


Leu 


Tyr 


Trp 


Thr Leu 


Asn Gly 


Arg 


Arg Leu Pro Pro 




35 










40 






45 


Glu Leu 


Ser 


Arg 


Val 


Leu 


Asn 


Ala Ser 


Thr Leu 


Ala 


Leu Ala Leu Ala 


50 










55 






60 




Asn Leu 


Asn 


Gly 


Ser 


Arg 


Gin 


Arg Ser 


Gly Asp 


Asn 


Leu Val Cys His 


65 








70 






75 




80 


Ala Arg 


Asp 


Gly 


Ser 


He 


Leu 


Ala Gly 


Ser Cys 


Leu 


Tyr Val Gly Leu 








85 








90 




95 


Pro Pro 


Glu 


Lys 


Pro 


Val 


Asn 


He Ser 


Cys Trp 


Ser 


Lys Asn Met Lys 






100 








105 






110 


Asp Leu 


Thr 


Cys 


Arg 


Trp 


Thr 


Pro Gly 


Ala His 


Gly 


Glu Thr Phe Leu 




115 










120 






125 


His Thr 


Asn 


Tyr 


Ser 


Leu 


Lys 


Tyr Lys 


Leu Arg 


Trp 


Tyr Gly Gin Asp 


130 










135 






140 


Asn Thr 


Cys 


Glu 


Glu 


Tyr 


His 


Thr Val 


Gly Pro 


His 


Ser Cys His He 


145 








150 






155 




160 


Pro Lys 


Asp 


Leu 


Ala 


Leu 


Phe 


Thr Pro 


Tyr Glu 


He 


Trp Val Glu Ala 








165 








170 




175 


Thr Asn 


Arg 


Leu 


Gly 


Ser 


Ala 


Arg Ser 


Asp Val 


Leu 


Thr Leu Asp He 






180 








185 






190 


Leu Asp 


Val 


Val 


Thr 


Thr 


Asp 


Pro Pro 


Pro Glu 


Val 


His Val Ser Arg 




195 










200 






205 


Val Gly 


Gly 


Leu 


Glu 


Asp 


Gin 


Leu Ser 


Val Arg 


Trp 


Val Ser Pro Pro 


210 










215 






220 




Ala Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


Gin Ala 


Lys Tyr 


Gin 


He Arg Tyr Arg 


225 








230 






235 




240 


Val Glu 


Asp 


Ser 


Val 


Asp 


Trp 


Lys Val 


Val Asp 


Asp 


Val Ser Asn Gin 








245 








250 




255 


Thr Ser 


Cys 


Arg 


Leu 


Ala 


Gly 


Leu Lys 


Pro Gly 


Thr 


Val Tyr Phe Val 






260 








265 






270 


Gin Val 


Arg 


Cys 


Asn 


Pro 


Phe 


Gly He 


Tyr Gly 


Ser 


Lys Lys Ala Gly 




275 










280 






285 


He Trp 


Ser 


Glu 


Trp 


Ser 


His 


Pro Thr 


Ala Ala 


Ser 


Thr Pro Arg 


290 










295 






300 



(2) INFORMATION FOR SEQ ID NO: 22: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE 


DESCRIPTION 


: SEQ ID N0:22: 




Ala His 


Thr 


Ala 


Val 


He 


Ser 


Pro 


Gin 


A^n Prn 


Thr 

III! 


Leu Leu He Gly 


1 






5 










10 




15 


Ser Ser 


Leu 


Leu 


Ala 


Thr 


Cvs 


Spr 


Val 


His Glv 

1 II D VJ 1 Jr 


Asn 

nop 


Pro Pro Gly Ala 






20 










25 






30 


Thr Ala 


Glu 


Gly 


Leu 


Tvr 


Tro 


Thr 


I pij 


A<;n Glv 

noi i vj i jr 


Arn 
mi y 


Arg Leu Pro Pro 




35 










40 








45 


Glu Leu 


Ser 


Arq 


Val 


Leu 


Asn 


Ala 


Spr 


Thr 1 pii 


Ala 


Leu Ala Leu Ala 


50 










55 








60 




Asn Leu 


Asn 


Gly 


Ser 


Ara 


Gin 


Ara 


Spr 


Glv Asn 


Asn 


Leu Val Cys His 


65 








70 








75 




80 


Ala Arg 


Asp 


Gly 


Ser 


He 


Leu 


Ala 


Glv 


Spr fvs 


1 Ptl 

LCD 


Tyr Val Gly Leu 








85 










90 




95 


Pro Pro 


Glu 


Lys 


Pro 


Val 


Asn 


He 


Ser 


Cvs TrD 


Ser 


Lys Asn Met Lys 






100 










105 






110 


Asp Leu 


Thr 


Cys 


Arg 


TrD 


Thr 


Pro 


Gly 


Ala His 


Glv 

U 1 J 


Glu Thr Phe Leu 




115 










120 








125 


His Thr 


Asn 


Tyr 


Ser 


Leu 


Lys 


Tvr 


Lvs 


Leu Arg 


Trn 


Tyr Gly Gin Asp 


130 










135 








140 


Asn Thr 


Cys 


Glu 


Glu 


Tyr 


His 


Thr 


Val 


Gly Pro 


His 


Ser Cys His He 


145 








150 








155 




160 


Pro Lys 


Asp 


Leu 


Ala 


Leu 


Phe 


Thr 


Pro 


Tyr Glu 


He 


Trp Val Glu Ala 








165 










170 




175 


Thr Asn 


Arg 


Leu 


Gly 


Ser 


Ala 


Arg 


Ser 


Asp Val 


Leu 


Thr Leu Asp He 






180 










185 






190 


Leu Asp 


Val 


Val 


Thr 


Thr 


Asp 


Pro 


Pro 


Pro Asp 


Val 


His Val Ser Arg 




195 










200 








205 


Val Gly 


Gly 


Leu 


Glu 


Asp 


Gin 


Leu 


Ser 


Val Arg 


Trp 


Val Ser Pro Pro 


210 










215 








220 




Ala Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


Gin 


Ala 


Lys Tyr 


Gin 


He Arg Tyr Arg 


225 








230 








235 




240 


Val Glu 


Asp 


Ser 


Val 


Asp 


Trp 


Lys 


Val 


Val Asp 


Asp 


Val Ser Asn Gin 








245 










250 




255 


Thr Ser 


Cys 


Arg 


Leu 


Ala 


Gly 


Leu 


Lys 


Pro Gly 


Thr 


Val Tyr Phe Val 






260 










265 






270 
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Gin 
u 1 ( 1 


vd I 


Apn P\/c Acn 

Mr y Lyb MSn 


rTO rne biy 


11a 
l te 


Tyr Gly Ser 


Lys Lys Ala Gly 






CI 0 












Tlo 
1 1 c 


i rp 


jc" biu 1 rp 


oer his Pro 


ihr 


Ala Ala Ser 


Thr Pro Arg Ser 














300 


b 1 U 


Arg 


rPO biy rTO 


bly bly Gly 


Ala 


Cys Glu Pro 


Arg Gly Gly Glu 








Tin 
olU 






315 


320 


n i^. 
rPO 


Ser 


ber bly Pro 


val Arg Arg 


Glu 


Leu Lys Gin 


Phe Leu Gly Trp 






325 






330 




335 


Leu 


Lys 


Lys His Ala 


Tyr Cys Ser 


Asn 


Leu Ser Phe 


Arg Leu Tyr Asp 






340 




345 






350 


Gin 


Trp 


Arg Ala Trp 


Met Gin Lys 


Ser 


His Lys Thr 


Arg Asn Gin His 






355 


360 








365 


Arg 


Thr 


Arg Gly Ser 


Cys Pro Arg 


Ala 


Asp Gly Ala 


Arg Arg Glu Val 




370 




375 






380 




Leu 


Pro 


Asp Lys Leu 












oob 




















(2) INFORMATION FOR SEQ ID 


N0:23: 






(i) SEQUENCE CHARACTERISTICS: 












(A) LENGTH: 


303 amino acids 












(B) TYPE: amino acid 














(C) STRANDEDNESS: single 












(0) TOPOLOGY: linear 












(ii) MOLECULE 


TYPE: protein 










(xi) SEQUENCE 


DESCRIPTION: 


: SEQ ID 


N0:23: 




AT a 
A 1 d 


His 


Thr Ala Val 


He Ser Pro 


Gin 


Asp 


Pro Thr 


Leu Leu He Gly 


1 
1 




5 






10 




15 


Cap 


Ser 


Leu Leu Ala 


Thr Cys Ser 


Val 


His 


Gly Asp 


Pro Pro Gly Ala 






20 




25 






30 


Thr 
nit 


Ala 


Glu Gly Leu 


Tyr Trp Thr 


Leu 


Asn 


Gly Arg 


Arg Leu Pro Pro 






35 


40 








45 


Gin 
b 1 U 


Leu 


Ser Arg Val 


Leu Asn Ala 


Ser 


Thr 


Leu Ala 


Leu Ala Leu Ala 




50 




55 






60 




Asn 


Leu 


Asn Gly Ser 


Arg Gin Arg 


Ser 


Gly 


Asp Asn 


Leu Val Cys His 


65 






70 






75 


80 


Ala 


Arg 


Asp Gly Ser 


He Leu Ala 


Gly 


Ser 


Cys Leu 


Tyr Val Gly Leu 






85 






90 




95 


Pro 


Pro 


Glu Lys Pro 


Val Asn He 


Ser 


Cys 


Trp Ser 


Lys Asn Met Lys 






100 




105 






110 


Asp 


Leu 


Thr Cys Arg 


Trp Thr Pro 


Gly 


Ala 


His Gly 


Glu Thr Phe Leu 






115 


120 








125 
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HIS 


Thr 


Asn 


Tyr 


Ser 


Leu 


Lys 


Tyr 


Lys 


Leu Arg Trp 


Tyr Gly Gin Asp 




i on 
1JU 










i or 

135 






140 




Asn 


inr 


Cys 


Glu 


Glu 


Tyr 


His 


Thr 


Val 


Gly Pro His 


Ser Cys His He 


14b 










150 








155 


160 


Pro 


Lys 


Asp 


Leu 


Ala 


Leu 


Phe 


Thr 


Pro 


Tyr Glu He 


Trp Val Glu Ala 










1 CXI 

165 










170 


175 


Thr 


Asn 


Arg 


Leu 


Gly 


Ser 


Ala 


Arg 


Ser 


Asp Val Leu 


Thr Leu Asp He 








1 OA 

180 










185 




190 


Leu 


Asp 


Val 


Val 


Thr 


Thr 


Asp 


Pro 


Pro 


Pro Asp Val 


His Val Ser Arg 






195 










200 






205 


Val 


Gly 


Gly 


Leu 


Glu 


Asp 


Gin 


Leu 


Ser 


Val Arg Trp 


Val Ser Pro Pro 




210 










215 






220 




Ala 


Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


Gin 


Ala 


Lys Tyr Gin 


He Arg Tyr Arg 


225 










230 








235 


240 


Val 


Glu 


Asp 


Ser 


Val 


Asp 


Trp 


Lys 


Val 


Val Asp Asp 


Val Ser Asn Gin 










245 










250 


255 


Thr 


Ser 


Cys 


Arg 


Leu 


Ala 


Gly 


Leu 


Lys 


Pro Gly Thr 


Val Tyr Phe Val 








260 










265 




270 


Gin 


Val 


Arg 


Cys 


Asn 


Pro 


Phe 


Gly 


He 


Tyr Gly Ser 


Lys Lys Ala Gly 






275 










280 






285 


He 


Trp 


Ser 


Glu 


Trp 


Ser 


His 


Pro 


Thr 


Ala Ala Ser 


Thr Pro Arg 




290 










295 






300 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 



Ala 


His 


Thr Ala 


Val 


He 


Ser 


Pro Gin 


Asp 


Pro 


Thr 


Leu Leu He Gly 


1 






5 








10 






15 


Ser 


Ser 


Leu Leu 


Ala 


Thr 


Cys 


Ser Val 


His 


Gly 


Asp 


Pro Pro Gly Ala 






20 








25 








30 


Thr 


Ala 


Glu Gly 


Leu 


Tyr 


Trp 


Thr Leu 


Asn 


Gly 


Arg 


Arg Leu Pro Pro 






35 








40 








45 


Glu 


Leu 


Ser Arg 


Val 


Leu 


Asn 


Ala Ser 


Thr 


Leu 


Ala 


Leu Ala Leu Ala 




50 








55 








60 




Asn 


Leu 


Asn Gly 


Ser 


Arg 


Gin 


Arg Ser 


Gly 


Asp 


Asn 


Leu Val Cys His 


65 








70 








75 




80 
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Ala A r>n 

Mia Mry 


A c r\ 
MSp 


b iy 


Ser 


Tin 

i ie 


Leu 


A 1 -k 

Ala 
















rrO rrO 


CI 1 1 
b 1 U 


Lys 


pro 


va i 


Asn 


I le 






1 nn 










A c r\ L d i 

MSp LcU 


i nr 


Cys 


Arg 


Trp 


i nr 


Fro 




1 1 c; 

110 










1 9ft 


his inr 


Asn 


Tyr 


Ser 


Leu 


Lys 


Tyr 












lob 




Asn inr 


Lys 


pi 1 1 

blU 


pi • i 

blU 


Tyr 


His 


Thr 


1 /it 
140 








IbU 






Pro Lys 


Asp 


Leu 


Ala 


Leu 


Phe 


Thr 








loo 








inr Asn 


Arg 


Leu 


PT. w 

Gly 


Ser 


Ala 


Arg 






1 Qft 










lgu Asp 


va 1 


va I 


Thr 


Thr 


Asp 


Pro 




iyo 










c\J\) 


w 3 i pi w 
Vd 1 biy 


bly 


Leu 


pi . . 

blU 


Asp 


PI « 

Gin 


Leu 












91 C 

^lo 




Ala Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


PT « 

Gin 


CCD 








99 A 






rill 
Vd 1 blU 


Asp 


Ser 


Va 1 


Asp 


Trp 


Lys 








OA C 
CHO 








Thr Ser 


Cys 


Arg 


Leu 


Ala 


Gly 


Leu 
















bin va 1 


Arg 


pi • ~ 

Cys 


Asn 


Pro 


Phe 


Gly 




97C 

c/o 










o o r\ 

280 


lie 1 rp 


Ser 


pi . . 
Glu 


Trp 


Ser 


His 


Pro 


9an 










one 
cyh 




biu Arg 


Dun 

Fro 


PT u 

Gly 


Pro 


pi . * 
Gly 


pi . . 
Gly 


Gly 


oUO 








91 ft 






pro oer 


Ser 


PI w 

bly 


Pro 


val 


Arg 


Arg 








JcO 








Leu Lys 


Lys 


His 


Ala 


Tyr 


Cys 


Ser 






340 










Gin Trp 


Arg 


Ala 


Trp 


Met 


Gin 


Lys 




355 










360 


Arg Thr 


Arg 


Gly 


Ser 


Cys 


Pro 


Arg 


370 










375 




Leu Pro 


Asp 


Lys 


Leu 









385 



Piw 

Gly 


Ser Cys Leu 


Tyr Val Gly Leu 




Oft 


95 


Ser 


Cys Trp Ser 


Lys Asn Met Lys 


lUo 




Tin 

110 


PT v , 

Gly 


Ser His Gly 


Glu Thr Phe Leu 






125 


Lys 


Leu Arg Trp 


Tyr Gly Gin Asp 




140 




Val 


Gly Pro His 


Ser Cys His He 




155 


160 


Pro 


Tyr Glu He 


Trp Val Glu Ala 




1 "7ft 

170 


175 


Ser 


Asp Val Leu 


Thr Leu Asp He 


185 




190 


Pro 


Pro Asp Val 


His Val Ser Arg 






205 


Ser 


Val Arg Trp 


Val Ser Pro Pro 




220 




Ala 


Lys Tyr Gin 


He Arg Tyr Arg 




235 


240 


Val 


Val Asp Asp 


Val Ser Asn Gin 




or r\ 

250 


255 


Lys 


Pro Gly Thr 


Val Tyr Phe Val 


o/~r~ 

265 




270 


He 


Tyr Gly Ser 


Lys Lys Ala Gly 






285 


Thr 


Ala Ala Ser 


Thr Pro Arg Ser 




300 




Ala 


Cys Glu Pro 


Arg Gly Gly Glu 




315 


320 


Glu 


Leu Lys Gin 


Phe Leu Gly Trp 




330 


335 


Asn 


Leu Ser Phe 


Arg Leu Tyr Asp 


345 




350 


Ser 


His Lys Thr 


Arg Asn Gin His 






365 


Ala 


Asp Gly Ala 


Arg Arg Glu Val 




380 





(2) INFORMATION FOR SEQ ID N0:25: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 389 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:25: 



Ala 


His 


Thr 


Ala 


Val 


He 


Ser 


Pro 


Gin 


Asp Pro 


Thr 


Leu Leu He Gly 


1 








5 










10 




15 


Ser 


Ser 


Leu 


Leu 


Ala 


Thr 


Cys 


Ser 


Val 


His Gly 


Asp 


Pro Pro Gly Ala 








20 










25 






30 


Thr 


Ala 


Glu 


Gly 


Leu 


Tyr 


Trp 


Thr 


Leu 


Asn Gly 


Arg 


Arg Leu Pro Pro 






35 










40 








45 


Glu 


Leu 


Ser 


Arg 


Val 


Leu 


Asn 


Ala 


Ser 


Thr Leu 


Ala 


Leu Ala Leu Ala 




50 










55 








60 




Asn 


Leu 


Asn 


Gly 


Ser 


Arg 


Gin 


Arg 


Ser 


Gly Asp 


Asn 


Leu Val Cys His 


65 










70 








75 




80 


Ala 


Arg 


Asp 


Gly 


Ser 


He 


Leu 


Ala 


Gly 


Ser Cys 


Leu 


Tyr Val Gly Leu 










85 










90 




95 


Pro 


Pro 


Glu 


Lys 


Pro 


Val 


Asn 


He 


Ser 


Cys Trp 


Ser 


Lys Asn Met Lys 








100 










105 






110 


Asp 


Leu 


Thr 


Cys 


Arg 


Trp 


Thr 


Pro 


Gly 


Ala His 


Gly 


Glu Thr Phe Leu 






115 










120 








125 


His 


TL 

Thr 


Asn 


Tyr 


Ser 


Leu 


Lys 


Tyr 


Lys 


Leu Arg 


Trp 


Tyr Gly Gin Asp 




130 










135 








140 


Asn 


Thr 


Cys 


Glu 


Asp 


Tyr 


His 


Thr 


Val 


Gly Pro 


His 


Ser Cys His He 


145 










150 








155 




160 


Pro 


Lys 


Asp 


Leu 


Ala 


Leu 


Phe 


Thr 


Pro 


Tyr Glu 


He 


Trp Val Glu Ala 










165 










170 




175 


Thr 


Asn 


Arg 


Leu 


Gly 


Ser 


Ala 


Arg 


Ser 


Asp Val 


Leu 


Thr Leu Asp He 








180 










185 






190 


Leu 


Asp 


Val 


Val 


Thr 


Thr 


Asp 


Pro 


Pro 


Pro Asp 


Val 


His Val Ser Arg 






195 










200 








205 


Val 


Gly 


Gly 


Leu 


Glu 


Asp 


Gin 


Leu 


Ser 


Val Arg 


Trp 


Val Ser Pro Pro 




210 










215 








220 




Ala 


Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


Gin 


Ala 


Lys Tyr 


Gin 


He Arg Tyr Arg 


225 










230 








235 




240 


Val 


Glu 


Asp 


Ser 


Val 


Asp 


Trp 


Lys 


Val 


Val Asp 


Asp 


Val Ser Asn Gin 










245 










250 




255 


Thr 


Ser 


Cys 


Arg 


Leu 


Ala 


Gly 


Leu 


Lys 


Pro Gly 


Thr 


Val Tyr Phe Val 








260 










265 






270 


Gin 


Val 


Arg 


Cys 


Asn 


Pro 


Phe 


Gly 


He 


Tyr Gly 


Ser 


Lys Lys Ala Gly 






275 










280 








285 
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Tin 

l le 


Trp 


Ser 


b lU 


Trp 


Ser 


His 


Pro 


Thr 


Ala Ala Ser Thr 


Pro Arg Ser 














295 






300 


b 1 U 


Arg 


Pro 


Gly 


Pro 


Gly 


Gly 


Gly 


Ala 


Cys Glu Pro Arg 


Gly Gly Glu 


oUO 










310 








315 


320 


Pro 


Ser 


Ser 


Gly 


Pro 


Val 


Arg 


Arg 


Glu 


Leu Lys Gin Phe 


Leu Gly Trp 










325 










330 


335 


Leu 


Lys 


Lys 


His 


Ala 


Tyr 


Cys 


Ser 


Asn 


Leu Ser Phe Arg 


Leu Tyr Asp 








340 










345 




350 


Gin 


Trp 


Arg 


Ala 


Trp 


Met 


Gin 


Lys 


Ser 


His Lys Thr Arg 


Asn Gin His 






355 










360 




365 




Arg 


Thr 


Arg 


Gly 


Ser 


Cys 


Pro 


Arg 


Ala 


Asp Gly Ala Arg 


Arg Glu Val 




370 










375 






380 




Leu 


Pro 


Asp 


Lys 


Leu 















385 



(2) INFORMATION FOR SEQ ID N0:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



Ala 


His 


Thr 


Ala 


Val 


He 


Ser 


Pro 


Gin 


Asp 


Pro 


Thr Leu 


Leu He Gly 


1 








5 










10 






15 


Ser 


Ser 


Leu 


Leu 


Ala 


Thr 


Cys 


Ser 


Val 


His 


Gly 


Asp Pro 


Pro Gly Ala 








20 










25 








30 


Thr 


Ala 


Glu 


Gly 


Leu 


Tyr 


Trp 


Thr 


Leu 


Asn 


Gly 


Arg Arg 


Leu Pro Pro 






35 










40 








45 




Glu 


Leu 


Ser 


Arg 


Val 


Leu 


Asn 


Ala 


Ser 


Thr 


Leu 


Ala Leu 


Ala Leu Ala 




50 










55 










60 




Asn 


Leu 


Asn 


Gly 


Ser 


Arg 


Gin 


Arg 


Ser 


Gly 


Asp 


Asn Leu 


Val Cys His 


65 










70 










75 




80 


Ala 


Arg 


Asp 


Gly 


Ser 


He 


Leu 


Ala 


Gly 


Ser 


Cys 


Leu Tyr 


Val Gly Leu 










85 










90 






95 


Pro 


Pro 


Glu 


Lys 


Pro 


Val 


Asn 


He 


Ser 


Cys 


Trp 


Ser Lys 


Asn Met Lys 








100 










105 








110 


Asp 


Leu 


Thr 


Cys 


Arg 


Trp 


Thr 


Pro 


Gly 


Ala 


His 


Gly Glu 


Thr Phe Leu 






115 










120 








125 




His 


Thr 


Asn 


Tyr 


Ser 


Leu 


Lys 


Tyr 


Lys 


Leu 


Arg 


Trp Tyr 


Gly Gin Asp 




130 










135 










140 



WO 98/49307 



PCT/US98/08865 



93 



Asn 


Thr 


Cys 


Glu 


Glu 


Tyr 


His 


Thr 


Val 


Gly Pro His Ser 


Cys His He 


145 










150 








155 


160 


Pro 


Lys 


Asp 


Leu 


Thr 


Leu 


Phe 


Thr 


Pro 


Tyr Glu He Trp 


Val Glu Ala 










165 










170 


175 


Thr 


Asn 


Arg 


Leu 


Gly 


Ser 


Ala 


Arg 


Ser 


Asp Val Leu Thr 


Leu Asp He 








180 










185 




190 


Leu 


Asp 


Val 


Val 


Thr 


Thr 


Asp 


Pro 


Pro 


Pro Asp Val His 


Val Ser Arg 






195 










200 




205 


Val 


Gly 


Gly 


Leu 


Glu 


Asp 


Gin 


Leu 


Ser 


Val Arg Trp Val 


Ser Pro Pro 




210 










215 






220 




Ala 


Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


Gin 


Ala 


Lys Tyr Gin He 


Arg Tyr Arg 


225 










230 








235 


240 


Val 


Glu 


Asp 


Ser 


Val 


Asp 


Trp 


Lys 


Val 


Val Asp Asp Val 


Ser Asn Gin 










245 










250 


255 


Thr 


Ser 


Cys 


Arg 


Leu 


Ala 


Gly 


Leu 


Lys 


Pro Gly Thr Val 


Tyr Phe Val 








260 










265 




270 


Gin 


Val 


Arg 


Cys 


Asn 


Pro 


Phe 


Gly 


He 


Tyr Gly Ser Lys 


Lys Ala Gly 






275 










280 




285 




He 


Trp 


Ser 


Glu 


Trp 


Ser 


His 


Pro 


Thr 


Ala Ala Ser Thr 


Pro Arg Ser 




290 










295 






300 


Glu 


Arg 


Pro 


Gly 


Pro 


Gly 


Gly 


Gly 


Ala 


Cys Glu Pro Arg 


Gly Gly Glu 


305 










310 








315 


320 


Pro 


Ser 


Ser 


Gly 


Pro 


Val 


Arg 


Arg 


Glu 


Leu Lys Gin Phe 


Leu Gly Trp 










325 










330 


335 


Leu 


Lys 


Lys 


His 


Ala 


Tyr 


Cys 


Ser 


Asn 


Leu Ser Phe Arg 


Leu Tyr Asp 








340 










345 




350 


Gin 


Trp 


Arg 


Ala 


Trp 


Met 


Gin 


Lys 


Ser 


His Lys Thr Arg 


Asn Gin His 






355 










360 




365 




Arg 


Thr 


Arg 


Gly 


Ser 


Cys 


Pro 


Arg 


Ala 


Asp Gly Ala Arg 


Arg Glu Val 




370 










375 






380 




Leu 


Pro 


Asp 


Lys 


Leu 















385 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
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Ala 


HIS 


inr 


Ala 


Val 


T 1 . 

He 


Ser 


Pro 


i 
i 








b 








Ser 


Ser 


Leu 


Leu 


Ala 


TU ^ 

Thr 


Cys 


Ser 








on 










inr 


Ala 


blU 


Gly 


Leu 


Tyr 


T 

Trp 


TL 

Thr 






ob 










40 


bill 


Leu 


Ser 


Arg 


Val 


Leu 


Asn 


Ala 




r a 

bU 










r* r 

55 




Asn 


Leu 


Asn 


Gly 


Ser 


Arg 


Gin 


Arg 


cc 
00 










70 






Ala 


Arg 


Asp 


Gly 


Ser 


He 


Leu 


Ala 










85 








Pro 


Pro 


Glu 


Lys 


Pro 


Val 


Asn 


He 








100 










Asp 


Leu 


TL 

Thr 


Cys 


Arg 


Trp 


Thr 


Pro 






115 










120 


His 


Thr 


Asn 


Tyr 


Ser 


Leu 


Lys 


Tyr 




i on 










135 




Asn 


Thr 


Cys 


Glu 


Glu 


Tyr 


His 


Thr 


1 AC 

14b 










150 






Pro 


Lys 


Asp 


Leu 


Ala 


Leu 


Phe 


Thr 










165 








Thr 


Asn 


Arg 


Leu 


Gly 


Ser 


Ser 


Arg 








180 










Leu 


Asp 


Val 


Val 


Thr 


Thr 


Asp 


Pro 






195 










200 


Val 


Gly 


Gly 


Leu 


Glu 


Asp 


Gin 


Leu 




210 










215 




Ala 


Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


Gin 


22b 










230 






Val 


Glu 


Asp 


Ser 


Val 


Asp 


Trp 


Lys 










245 








Thr 


Ser 


Cys 


Arg 


Leu 


Ala 


Gly 


Leu 








ocn 










bin 


va 1 


Arg 


Cys 


Asn 


Pro 


r\l_ 

Phe 


Gly 






97c; 










2oU 


He 


Trp 


Ser 


Glu 


Trp 


Ser 


His 


Pro 




290 










295 




Glu 


Arg 


Pro 


Gly 


Pro 


Gly 


Gly 


Gly 


305 










310 






Pro 


Ser 


Ser 


Gly 


Pro 


Val 


Arg 


Arg 










325 








Leu 


Lys 


Lys 


His 


Ala 


Tyr 


Cys 


Ser 



340 





94 




Gin 


Asp 


Pro Thr Leu Leu He Gly 




10 


15 


Val 


His 


Gly Asp Pro Pro Gly Ala 


25 




30 


Leu 


Asn 


Gly Arg Arg Leu Pro Pro 






45 


Ser 


Thr 


Leu Ala Leu Ala Leu Ala 






60 


Ser 


Gly 


Asp Asn Leu Val Cys His 






75 80 


Gly 


Ser 


Cys Leu Tyr Val Gly Leu 




90 


95 


Ser 


Cys 


TrD Ser Lvs Asn Met Lvs 


105 




110 


Gly 


Ala 


His Gly Glu Thr Phe Leu 






125 


Lys 


Leu 


Arq TrD Tyr Glv Gin Asd 






140 


Val 


Gly 


Pro His Ser Cvs His He 






155 160 


Pro 


Tyr 


Glu He Trp Val Glu Ala 




170 


175 


Ser 


Asd 


Val Leu Thr Leu Asd lie 


185 




190 


Pro 


Pro 


Asp Val His Val Ser Arg 






205 


Ser 


Val 


Arq TrD Val Ser Pro Pro 






220 


Ala 


Lys 


Tyr Gin He Arq Tvr Ara 






235 240 


Val 


Val 


Asd Asd Val Ser Asn Gin 




250 


255 


Lys 


Pro 


Gly Thr Val Tyr Phe Val 


265 




270 


He 


Tyr 


Gly Ser Lys Lys Ala Gly 






285 


Thr 


Ala 


Ala Ser Thr Pro Arg Ser 






300 


Ala 


Cys 


Glu Pro Ara Glv Glv Glu 






315 320 


Glu 


Leu 


Lys Gin Phe Leu Gly Trp 




330 


335 


Asn 


Leu 


Ser Phe Arg Leu Tyr Asp 


345 




350 
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Gin Trp Arg Ala Trp Met Gin Lys Ser 

355 360 
Arg Thr Arg Gly Ser Cys Pro Arg Ala 

370 375 
Leu Pro Asp Lys Leu 
385 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



His Lys Thr Arg Asn Gin His 
365 

Asp Gly Ala Arg Arg Glu Val 
380 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



Ala 


His 


Thr 


Ala 


Val 


He 


Ser 


Pro 


1 








5 








Ser 


Ser 


Leu 


Leu 
20 


Ala 


Thr 


Cys 


Ser 


Thr 


Ala 


Glu 
35 


Gly 


Leu 


Tyr 


Trp 


Thr 
40 


Glu 


Leu 
50 


Ser 


Arg 


Val 


Leu 


Asn 
55 


Ala 


Asn 


Leu 


Asn 


Gly 


Ser 


Arg 


Gin 


Arg 


65 










70 






Ala 


Arg 


Asp 


Gly 


Ser 
85 


He 


Leu 


Ala 


Pro 


Pro 


Glu 


Lys 
100 


Pro 


Val 


Asn 


He 


Asp 


Leu 


Thr 
115 


Cys 


Arg 


Trp 


Thr 


Pro 
120 


His 


Thr 
130 


Asn 


Tyr 


Ser 


Leu 


Lys 
135 


Tyr 


Asn 


Thr 


Cys 


Glu 


Glu 


Tyr 


His 


Thr 


145 










150 






Pro 


Lys 


Asp 


Leu 


Ala 
165 


Leu 


Phe 


Thr 


Thr 


Asn 


Arg 


Leu 
180 


Gly 


Ser 


Ala 


Arg 


Leu 


Asp 


Val 
195 


Val 


Thr 


Thr 


Asp 


Pro 
200 



Gin 


Asp 


Pro 


Thr 


Leu Leu He Gly 




10 






15 


Val 


His 


Gly 


Asp 


Pro Pro Gly Ala 


25 








30 


Leu 


Asn 


Gly 


Arg 


Arg Leu Pro Pro 










45 


Ser 


Thr 


Leu 


Ala 


Leu Ala Leu Ala 








60 




Ser 


Gly 


Asp 


Asn 


Leu Val Cys His 






75 




80 


Gly 


Ser 


Cys 


Leu 


Tyr Val Gly Leu 




90 






95 


Ser 


Cys 


Trp 


Ser 


Lys Asn Met Lys 


105 








110 


Gly 


Ala 


His 


Gly 


Glu Thr Phe Leu 










125 


Lys 


Leu 


Arg 


Trp 


Tyr Gly Gin Asp 








140 




Val 


Gly 


Pro 


His 


Ser Cys His He 






155 




160 


Pro 


Tyr 


Glu 


He 


Trp Val Glu Ala 




170 






175 


Ser 


Asp 


Val 


Leu 


Thr Leu Asp Val 


185 








190 


Pro 


Pro 


Asp 


Val 


His Val Ser Arg 
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96 



va i 


b ty 


biy 


Leu 


b lU 


Asp 


bin 


Leu 


Ser 


Val Arg Trp 


val Ser Pro Pro 




9i n 
















cZi) 




AT ^ 

Ala 


Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


Gin 


Ala 


Lys Tyr Gin 


He Arg Tyr Arg 


CCD 










no a 

230 








OOP 

235 


240 


Va 1 


blU 


Asp 


Ser 


Va 1 


Asp 


Trp 


Lys 


Val 


Val Asp Asp 


Val Ser Asn Gin 










Z4b 










OTA 

250 


r\ r- r- 

255 


Inr 


ber 


Cys 


Arg 


Leu 


Ala 


Gly 


Leu 


Lys 


Pro Gly Thr 


Val Tyr Phe Val 








2oU 










265 




270 


bin 


Va 1 


Arg 


Cys 


Asn 


Pro 


Phe 


Gly 


He 


Tyr Gly Ser 


Lys Lys Ala Gly 






£/b 










on a 

280 






285 


I le 


Trp 


Ser 


Glu 


Trp 


Ser 


1 1 * 
His 


Pro 


Thr 


Ala Ala Ser 


Thr Pro Arg Ser 




290 










295 






300 


Glu 


Arg 


Pro 


Gly 


pv 

Pro 


/NT 

Gly 


Gly 


Gly 


Ala 


Cys Glu Pro 


Arg Gly Gly Glu 


oOb 










Oi A 

310 








315 


320 


Pro 


Ser 


Ser 


Gly 


Pro 


Val 


Arg 


Arg 


Glu 


Leu Lys Gin 


Phe Leu Gly Trp 










325 










330 


335 


Leu 


Lys 


Lys 


His 


Ala 


Tyr 


Cys 


Ser 


Asn 


Leu Ser Phe 


Arg Leu Tyr Asp 








340 










345 




350 


Gin 


Trp 


Arg 


Ala 


Trp 


Met 


Gin 


Lys 


Ser 


His Lys Thr 


Arg Asn Gin His 






355 










360 






365 


Arg 


Thr 


Arg 


Gly 


Ser 


Cys 


Pro 


Arg 


Ala 


Asp Gly Ala 


Arg Arg Glu Val 




370 










375 






380 




Leu 


Pro 


Asp 


Lys 


Leu 















385 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:29: 

Ala His Thr Ala Val He Ser Pro Gin Asp Pro Thr Leu Leu He Gly 
15 10 15 

Ser Ser Leu Leu Ala Thr Cys Ser Val His Gly Asp Pro Pro Gly Ala 

20 25 30 

Thr Ala Glu Gly Leu Tyr Trp Thr Leu Asn Gly Arg Arg Leu Pro Pro 

35 40 45 

Glu Leu Ser Arg Val Leu Asn Ala Ser Thr Leu Ala Leu Ala Leu Ala 
50 55 60 
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A c in 


1 Ol 1 

LcU 


Ac n 

Asn 


b iy 


Ser 


Arg 


bin 


Arg 


Ser 


pi w 
Gly 


Asp Asn Leu Val Cys His 


DO 










/u 










7C on 

/b 80 


A 1 a 


Arg 


ASp 


biy 


Ser 


1 ie 


Leu 


AT - 

Ala 


Gly 


Ser 


Cys Leu Tyr Val Gly Leu 










00 










90 


95 


riO 


rro 


b 1 U 


Lys 


pro 


va i 


Asn 


I le 


Ser 


Cys 


Trp Ser Lys Asn Met Lys 


















1 AC 

lUb 




i i n 

110 




Leu 


Thr* 

i nr 


Cys 


Arg 


Trp 


I nr 


Pro 


PT. L ■ 

Gly 


AT, 

Ala 


His Gly Glu Thr Phe Leu 






1 1 c; 

110 










i on 
12U 






1 or 

125 


Lin c 
Ml 5 


i nr 


Asn 


Tyr 


Ser 


Leu 


Lys 


Tyr 


Lys 


Leu 


A T* "T" /"\ T * 

Arg Trp Tyr Gly Gin Asp 




iou 










1 oc 

135 








140 


Asn 


i nr 


Lys 


b lU 


blU 


Tyr 


1 1 • ^ 
His 


Thr 


Val 


Gly 


Pro His Ser Cys His He 


±4b 










150 










155 160 


Pro 


Lys 


Asp 


Leu 


Ala 


Leu 


Phe 


Thr 


Pro 


Tyr 


Glu He Trp Val Glu Ala 










loo 










170 


175 


I nr 


Asn 


Arg 


Leu 


Gly 


Ser 


Ala 


Arg 


Ser 


Asp 


Val Leu Thr Leu Asp He 








loll 










1 or 

185 




190 


Leu 


Asp 


T I A 

i ie 


Va I 


Ihr 


Thr 


Asp 


Pro 


Pro 


Pro 


Asp Val His Val Ser Arg 
















onn 

200 






205 


Va 1 




biy 


Leu 


blU 


Asp 


PT. « 

Gin 


Leu 


Ser 


Val 


Arg Trp Val Ser Pro Pro 




91 n 










21b 








220 


AT a 

Aid 


Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


Gin 


Ala 


Lys 


Tyr Gin He Arg Tyr Arg 


££0 










230 










235 240 


Vd 1 


b 1 U 


Asp 


ber 


Va 1 


Asp 


Trp 


Lys 


Val 


Val 


Asp Asp Val Ser Asn Gin 










oa a 
24b 










or n 

250 


255 


i nr 


Ser 


Cys 


Arg 


Leu 


Ala 


Gly 


Leu 


Lys 


Pro 


Gly Thr Val Tyr Phe Val 


















265 




270 


n in 

u i n 


va i 


Arg 


p. 

Cys 


Asn 


Pro 


Phe 


Gly 


He 


Tyr 


Gly Ser Lys Lys Ala Gly 






97k 










280 






285 


Tin 


Trp 


Ser 


b lU 


Irp 


Ser 


His 


Pro 


Thr 


AT- 

Ala 


Ala Ser Thr Pro Arg Ser 




9Qn 










one 

295 








300 


b I u 


Arg 


rro 


PT, w 

biy 


Pro 


Gly 


Gly 


Gly 


Ala 


Cys 


Glu Pro Arg Gly Gly Glu 


JUO 










01 fk 

J10 










315 320 


rPO 


Ser 


Ser 


pi w 
biy 


Pro 


va 1 


Arg 


Arg 


Glu 


Leu 


Lys Gin Phe Leu Gly Trp 










Q9C 
0£b 










330 


335 


Leu 


Lys 


Lys 


His 


Ala 


Tyr 


Cys 


Ser 


Asn 


Leu 


Ser Phe Arg Leu Tyr Asp 








340 










345 




350 


Gin 


Trp 


Arg 


Ala 


Trp 


Met 


Gin 


Lys 


Ser 


His 


Lys Thr Arg Asn Gin His 






355 










360 






365 


Arg 


Thr 


Arg 


Gly 


Ser 


Cys 


Pro 


Arg 


Ala 


Asp 


Gly Ala Arg Arg Glu Val 




370 










375 








380 


Leu 


Pro 


Asp 


Lys 


Leu 















385 



(2) INFORMATION FOR SEQ ID NO: 30: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



Ala 


His 


Thr 


Ala 


Val 


He 


Ser 


Pro 


Gin 


Asp 


Pro 


Thr Leu Leu He Gly 


1 








5 










10 




15 


Ser 


Ser 


Leu 


Leu 


Ala 


Thr 


Cys 


Ser 


Val 


His 


Gly 


Asp Pro Pro Gly Ala 








20 










25 






30 


Thr 


Ala 


Glu 


Gly 


Leu 


Tyr 


Trp 


Thr 


Leu 


Asn 


Gly 


Arg Arg Leu Pro Pro 






35 










40 








45 


Glu 


Leu 


Ser 


Arg 


Val 


Leu 


Asn 


Ala 


Ser 


Thr 


Leu 


Ala Leu Ala Leu Ala 




50 










55 










60 


Asn 


Leu 


Asn 


Gly 


Ser 


Arg 


Gin 


Arg 


Ser 


Gly 


Asp 


Asn Leu Val Cys His 


65 










70 










75 


80 


Ala 


Arg 


Asp 


Gly 


Ser 


He 


Leu 


Ala 


Gly 


Ser 


Cys 


Leu Tyr Val Gly Leu 










85 










90 




95 


Pro 


Pro 


Glu 


Lys 


Pro 


Val 


Asn 


He 


Ser 


Cys 


Trp 


Ser Lys Asn Met Lys 








100 










105 






110 


Asp 


Leu 


Thr 


Cys 


Arg 


Trp 


Thr 


Pro 


Gly 


Ala 


His 


Gly Glu Thr Phe Leu 






lib 










120 








125 


His 


Thr 


Asn 


Tyr 


Ser 


Leu 


Lys 


Tyr 


Lys 


Leu 


Arg 


Trp Tyr Gly Gin Asp 




130 










135 










140 


Asn 


Thr 


Cys 


Glu 


Glu 


Tyr 


His 


Thr 


Val 


Gly 


Pro 


His Ser Cys His He 


145 










150 










155 


160 


Pro 


Lys 


Asp 


Leu 


Ala 


Leu 


Phe 


Thr 


Pro 


Tyr 


Glu 


He Trp Val Glu Ala 










165 










170 




175 


Thr 


Asn 


Arg 


Leu 


Gly 


Ser 


Ala 


Arg 


Ser 


Asp 


Val 


Leu Thr Leu Asp He 








180 










185 






190 


Leu 


Asp 


Val 


Val 


Thr 


Thr 


Asp 


Pro 


Pro 


Pro 


Asp 


Val His Val Ser Arg 






195 










200 








205 


Val 


Gly 


Gly 


Leu 


Glu 


Asp 


Gin 


Leu 


Ser 


Val 


Arg 


Trp Val Ser Pro Pro 




210 










215 








220 


Ala 


Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


Gin 


Ala 


Lys 


Tyr 


Gin He Arg Tyr Arg 


225 










230 










235 


240 


Val 


Glu 


Asp 


Ser 


Val 


Asp 


Trp 


Lys 


Val 


Val 


Asp 


Asp Val Ser Asn Gin 










245 










250 




255 


Thr 


Ser 


Cys 


Arg 


He 


Ala 


Gly 


Leu 


Lys 


Pro 


Gly 


Thr Val Tyr Phe Val 



260 265 270 
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u i n 


va 1 


arg 


Pwc 

uys 


Asn 


pro 


rne 


bly 


I le 


Tyr Gly Ser Lys Lys Ala 


Gly 




















00c 




i le 


Trp 


Ser 


b 1 U 


Trp 


Ser 


HIS 


Pro 


Thr 


Ala Ala Ser Thr Pro Arg 


Ser 














one 






300 




b 1 U 


Arg 


pro 


PI w 

bly 


Pro 


Gly 


Gly 


Gly 


Ala 


Cys Glu Pro Arg Gly Gly 


Glu 


one 










oil) 








0*1 r 

315 


320 


Pro 


Ser 


Ser 


Gly 


Pro 


Val 


Arg 


Arg 


Glu 


Leu Lys Gin Phe Leu Gly 


Trp 










325 










330 335 


Leu 


Lys 


Lys 


His 


Ala 


Tyr 


Cys 


Ser 


Asn 


Leu Ser Phe Arg Leu Tyr 


Asp 








340 










345 


350 


Gin 


Trp 


Arg 


Ala 


Trp 


Met 


Gin 


Lys 


Ser 


His Lys Thr Arg Asn Gin 


His 






355 










360 




365 




Arg 


Thr 


Arg 


Gly 


Ser 


Cys 


Pro 


Arg 


Ala 


Asp Gly Ala Arg Arg Glu 


Val 




370 










375 






380 




Leu 


Pro 


Asp 


Lys 


Leu 















385 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO': 31: 



Ala 


His 


Thr 


Ala 


Val 


He 


Ser 


Pro 


Gin 


Asp 


Pro 


Thr Leu Leu lie 


Gly 


1 








5 










10 




15 


Ser 


Ser 


Leu 


Leu 


Ala 


Thr 


Cys 


Ser 


Val 


His 


Gly 


Asp Pro Pro Gly 


Ala 








20 










25 






30 




Thr 


Ala 


Glu 


Gly 


Leu 


Tyr 


Trp 


Thr 


Leu 


Asn 


Gly 


Arg Arg Leu Pro 


Pro 






35 










40 








45 




Glu 


Leu 


Ser 


Arg 


Val 


Leu 


Asn 


Ala 


Ser 


Thr 


Leu 


Ala Leu Ala Leu 


Ala 




50 










55 










60 




Asn 


Leu 


Asn 


Gly 


Ser 


Arg 


Gin 


Arg 


Ser 


Gly 


Asp 


Asn Leu Val Cys 


His 


65 










70 










75 


80 


Ala 


Arg 


Asp 


Gly 


Ser 


He 


Leu 


Ala 


Gly 


Ser 


Cys 


Leu Tyr Val Gly 


Leu 










85 










90 




95 




Pro 


Pro 


Glu 


Lys 


Pro 


Val 


Asn 


He 


Ser 


Cys 


Trp 


Ser Lys Asn Met 


Lys 








100 










105 






110 


Asp 


Leu 


Thr 


Cys 


Arg 


Trp 


Thr 


Pro 


Gly 


Ala 


His 


Gly Glu Thr Phe 


Leu 



115 120 125 
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Hi<; 
n i o 


Thr 
1 1 1 1 


Mol 1 


iyr 


Cap 

pel 


1 Ol 1 

LcU 


Lys 


iyr 


Lys 


Leu Arg 


i rp iyr bly bin Asp 














loO 








i /in 
14U 


Mol 1 


Thr 
I f ll 


uyb 


u 1 U 


b 1 u 


Iyr 


Ml S 


i nr 


va i 


PI w Dnr\ 

bly pro 


his ber Lys His lie 


LHD 










lOU 








loo 


160 


rfu 


Lys 


Am 

ASp 


Leu 


AT n 

Ala 


Leu 


Phe 


i nr 


Pro 


Iyr blu 


He Trp Val Glu Ala 










1 fit: 










1/U 


I/O 


Thr 
I fir 


Mb II 


Arg 


Leu 


b iy 


Ser 


Al ^ 

Ala 


Arg 


Ser 


Asp Va 1 


Leu Thr Leu Asp He 


















1 QC 

loo 




190 


LcU 


A c r\ 

MSp 


Va 1 


va 1 


i nr 


i nr 


Asp 


Pro 


Pro 


Pro Asp 


Val His Val Ser Arg 






1:?0 
















one 

205 


Va 1 


b iy 


pi w 

b iy 


Leu 


pi 1 1 
b IU 


Asp 


PI in 

bin 


Leu 


Ser 


Val Arg 


Trp Val Ser Pro Pro 






















220 


Al a 
A 1 a 


Leu 


Lys 


Asp 


Oho 

pne 


Leu 


Phe 


PI *-v 

bin 


AT _ 

Ala 


Lys Tyr 


Gin He Arg Tyr Arg 


99C 
<i£0 


















235 


240 


va 1 


b 1 u 


Asp 


Ser 


va i 


Asp 


I rp 


Lys 


Val 


Val Asp 


Asp Val Ser Asn Gin 




















£0U 


255 


i nr 


Ser 


rue 

^ys 


Arg 


Leu 


1 le 


PT » i 

bly 


Leu 


Lys 


Pro Gly 


Thr Val Tyr Phe Val 


















£00 




270 


Gin 

13 1 II 


Va 1 


Arg 


Lys 


Asn 


pro 


pne 


PI V* 

bly 


I le 


Tyr Gly 


Ser Lys Lys Ala Gly 






97C 










9on 






285 


Tlo 

i le 


irp 


Ser 


pi 1 1 
b IU 


Trp 


Ser 


HlS 


Pro 


Thr 


Ala Ala 


Ser Thr Pro Arg Ser 














90C 








300 


b 1 u 


Arg 


pro 


pi w 
bly 


Pro 


pi i, 
bly 


pi . . 
Gly 


PT * - 

Gly 


Ala 


Cys Glu 


Pro Arg Gly Gly Glu 


OUO 


















TIC 

315 


320 


Pro 


Sep 


Ser 


bly 


Pro 


val 


Arg 


Arg 


Glu 


Leu Lys 


Gin Phe Leu Gly Trp 










99C 
O£0 










330 


335 


Leu 


Lys 


Lys 


His 


Ala 


Tyr 


Cys 


Ser 


Asn 


Leu Ser 


Phe Arg Leu Tyr Asp 








340 










345 




350 


Gin 


Trp 


Arg 


Ala 


Trp 


Met 


Gin 


Lys 


Ser 


His Lys 


Thr Arg Asn Gin His 






355 










360 






365 


Arg 


Thr 


Arg 


Gly 


Ser 


Cys 


Pro 


Arg 


Ala 


Asp Gly 


Ala Arg Arg Glu Val 




370 










375 








380 


Leu 


Pro 


Asp 


Lys 


Leu 
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(2) INFORMATION FOR SEQ ID N0:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Asn Ser Ala Arg Gly Ala Cys Val Pro Arg Arg Ala Pro Pro Pro Pro 

15 10 15 

Ser Arg Ser Pro Pro Arg Ala Pro Gly Ser Ala Gly Pro Met Pro Ala 

20 25 30 

Gly Pro Met Pro Ala Gly Arg Arg Gly Pro Ala Ala Gin Ser Ala Arg 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Asn Ser Ala Arg Gly Ala Cys Val Pro Arg Arg Ala Pro Pro Pro Pro 
15 10 15 

Ser Arg Ser Pro Pro Arg Ala Pro Gly Ser 
20 25 

(2) INFORMATION FOR SEQ ID N0:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:34: 

Ala Gly Pro Met Pro Ala Gly Pro Met Pro Ala Gly Arg Arg Gly Pro 

15 10 15 

Ala Ala Gin Ser Ala Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 35: 



WO 98/49307 



PCI7US98/08865 



102 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Ser Pro Pro Ala Leu Lys Asp Phe Leu Phe Gin Ala Lys Tyr Gin He 
15 10 15 

Arg Tyr Arg Val Glu Asp Ser Val Asp Trp Lys Val Val Asp Asp Val 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Ala Ala Ser Thr Pro Arg Ser Glu Arg Pro Gly Pro Gly Gly Gly Ala 
1 5 10 15 

Cys Glu Pro Arg Gly Gly Glu Pro Ser Ser Gly Pro Val Arg Arg 
20 25 30 

(2) INFORMATION FOR SEQ ID NO:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Trp Met Gin Lys Ser His Lys Thr Arg Asn Gin Asp Glu Gly He Leu 
15 10 15 

Pro Ser Gly Arg Arg Gly Thr Ala Arg Gly Pro Ala Arg 
20 25 
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CLAIMS 
We claim: 

1. An isolated polynucleotide which encodes a 
mammalian polypeptide, said polypeptide being comprised of an 
amino acid sequence selected from the group consisting of SEQ 
ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, and SEQ ID NOs : 17 - 37 
or a polypeptide which is 90%, 95% or 99% identical to said 
amino acid sequences. 

2 . The isolated polynucleotide of claim 1 wherein 
said polynucleotide is a DNA sequence. 

3 . The isolated polynucleotide of claim 1 wherein 
said polynucleotide is an RNA sequence. 

4 . An expression vector comprising the following 
operably linked elements: 

a transcription promoter; 

a DNA segment encoding a mammalian polypeptide, 
said polypeptide being comprised of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 2, SEQ ID NO: 
4, SEQ ID NO: 6, SEQ ID NOs: 17 - 37; and 

a transcription terminator. 

5. An isolated polypeptide said polypeptide being 
comprised of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, and 
SEQ ID NOs : 17 - 31 or a polypeptide which is 90%, 95% or 99% 
identical to said amino acid sequences. 

6. A peptide or polypeptide which has the amino acid 
sequence of an epitope-bearing portion of a ZcytorS 
polypeptide . 
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7. A polypeptide of claim 6 wherein the polypeptide 
has amino acid sequence of at least 15 amino acid residues. 

8. The polypeptide of claim 7 wherein said 
polypeptide is selected from the group of polypeptide 
consisting of the amino acid sequences of SEQ ID NOs : 32-37. 

9. An antibody which specifically binds to an 
epitope-binding sequence of a ZcytorS polypeptide. 

10. An antibody of claim 9 wherein said antibody 
binds to a polypeptide comprised of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 2, SEQ ID NO: 
4, SEQ ID NO: 6, and SEQ ID NOs : 17-37. 

11. An anti- idiotypic antibody of an antibody of 
claims 9 or 10. 

12 . A method for producing an antibody which binds 
to a ZcytorS polypeptide comprising inoculating an animal with 
an epitope -bearing amino acid sequence of ZcytorS polypeptide 
under conditions wherein said animal produces antibodies which 
bind. to the ZcytorS polypeptide; and 

isolating said antibodies. 
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